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This is a FIRST submission of items concerning a filing under 35 U S.C 371 
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A copy of the International Application as filed (35 U S C 371 (c) (2)) 

a. □ is attached hereto (required only if not communicated by the hitemational Bureau) 

b. El has been communicated by the International Bureau 

c. □ IS not required, as the application was filed in the United States Receiving Office (RO/US). 
An English language translation of the International Application as filed (35 U S C 371(c)(2)). 
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Amendments to the claims of the hitemational Application under PCT Article 19 (35 U S C. 371 (cX3)) 

a. □ are attached hereto (required only if not communicated by the International Bureau). 

b. □ have been communicated by the International Bureau 

c. □ have not been made, however, the time limit for making such amendments has NOT expired. 
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An English language translation of the amendments to the claims under PCT Article 19 (35 U S C 371(c)(3)). 
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□ Neither international prchininar\' examination fee (37 CVW I 482) nor 
international search fee (37 CFR 1 445(a)(2)) paid to USP LO 
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International- preliminary examination fee (37 CFR 1 482) not paid to 
USP'fO but International Search Report prepared by the F:P0 or JPO 
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but international search fee (37 CFR 1 445(a)(2)) paid to USPTO 

□ hitemational preliminary examination fee (37 CFR 1 482) paid to USPTO 
but all claims did not satisfy provisions of PCT Article 33(1 )-(4) 

□ .hitemational preliminary examination fee (37 CFR I 482) paid to USPTO 
and all claims satisfied provisions of PCT Article 33(1 )-(4) 
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^ Applicant claims small entity status See 37 CFR 1 27) The fees indicated above are 
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$445.00 



SUBTOTAL - 



$445.00 



Processing fee of $130.00 for fumishing the English translation later than 
months from the earliest claimed priority date (37 CFR 1 .492 (f)) 
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□ 30 
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TOTAL NATIONAL FEE 
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Fees are to be charged to a credit card WARNING: Information on this form may become public Credit card 
information should not be included on this form. Provide credit card information and authorization on PTO-2038 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re application of D. Goring and N. Silva 
U.S. Application No. Unassigned 
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Proline-rich Extensin-like Receptor Kinases 
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CERTIFICATE OF EXPRESS MAILING 

I hereby certify that this correspondence is being deposited with the United States Postal Service, "Express Mail 
Post Office to Addressee", Mailing Label No, EL 930922731 US addressed to: Commissioner for Patents, Box Patent 
Application, Washington, D.C. 20231 on Tuesday, February 19, 2002. 

facpn M. ^chlitter M 

Commissioner for Patents 
Box Patent Application 
Washington, D.C. 20231 



PRELIMINARY AMENDMENT UNDER 37 CFR §1.115 
REDUCING THE NUMBER OF CLAIMS PRIOR TO CALCULATION OF THE FILING FEE 



Sir: 



Applicants request entry of the following amendments. 
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In the Description 

Page 1, after line 2, please insert the following paragraph. 

—This is a continuation-in-part of international Application No. 
PCT/CAOO/00966, filed August 18, 2000, which claims priority to U.S. provisional 
Application No. 60/149,466, filed August 19, 1999, and to U.S. provisional 
Application No. 60/159,122, filed October 13, 1999.-- 

In the Claims 

Please cancel Claims 15 to 18, 22, 24, 26, 27, and 29 to 47 without prejudice. 
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Respectfully submitted. 




Gene J. Yao, Esquire 
Reg. No. 47,193 
Attomey for Applicants 
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SECOND PRELIMINARY AMENDMENT UNDER 37 CFR 6 1.115 



Sir: 



Applicants request entry of the following amendments. 
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In the Claims 

6. (Amended) An isolated nucleic acid molecule encoding a PERX polypeptide, a 
fragment of a PERK polypeptide having PERK activity, or a polypeptide 
having PERK activity, comprising a nucleic acid molecule selected from the 
group consisting of: (a) a nucleic acid molecule that hybridizes to a nucleic 
acid molecule consisting of [SEQ ID NO:l], or a complement thereof under 
low, moderate or high stringency hybridization conditions wherein the nucleic 
acid molecule encodes a PERK polypeptide or a polypeptide having PERK 
activity; and (b) a nucleic acid molecule degenerate with respect to (a), wherein 
the nucleic molecule encodes a PERK polypeptide or a polypeptide having 
PERK activity. 

9. (Amended) The nucleic acid molecule of claim 1, wherein the PERK polypeptide 
comprises a PERKl polypeptide, 

14. (Amended) A recombinant nucleic acid molecule comprising a nucleic acid molecule 
of claim 1 and a constitutive promoter sequence or an inducible promoter sequence, 
operatively linked so that the promoter enhances transcription of the nucleic acid 
molecule in a host cell. 

19. (Amended) A vector comprising the nucleic acid molecule of claim 1. 

23. (Ameneded) A plant, a plant part, a seed, a plant cell or progeny thereof comprising 
the recombinant nucleic acid molecule of claim 14. 

28. (Amended) An isolated polypeptide encoded by and/or produced from the nucleic 
acid molecule of claim 1. 
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REMARKS 



Claims 15 to 18, 22, 24, 26, 27, and 29 to 47 have been canceled without 
prejudice in a first Preliminary Amendment. Amendments have been made to Claims 
9, 14, 19, and 28 to convert the claims from multiple-dependent form. An amendment 
of an editorial nature has been made to Claim 6. Support for the amendment to Claim 
23 is found in the application at page 5, lines 28 and 29. No new matter has been 
added. Claims presently pending are Claims 1 to 14, 19 to 21, 23, 25, and 28. 

A marked-up version of the claims amended, showing the changes made, is 
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attached. 



Gene J. Yao, Esquire 
Reg. No. 47,193 
Attorney for Applicants 
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VERSION WITH MARKINGS TO SHOW CHANGES MADE 

(Amended) An isolated nucleic acid molecule encoding a PERK polypeptide, a 
fragment of a PERK polypeptide having PERK activity, or a polypeptide 
having PERK activity, comprising a nucleic acid molecule selected from the 
group consisting of: [(b)] {a} a nucleic acid molecule that hybridizes to a 
nucleic acid molecule consisting of [SEQ ID NO: 1], or a complement thereof 
under low, moderate or high stringency hybridization conditions wherein the 
nucleic acid molecule encodes a PERK polypeptide or a polypeptide having 
PERK activity; and (b) a nucleic acid molecule degenerate with respect to (a), 
wherein the nucleic molecule encodes a PERK polypeptide or a polypeptide 
having PERK activity. 



9. (Amended) The nucleic acid molecule of [any of claims 1 to 8] claim 1 . wherein the 
PERK polypeptide comprises a PERKl polypeptide. 

14. (Amended) A recombinant nucleic acid molecule comprising a nucleic acid molecule 
of [any of claims 1 to 4] claim 1 and a constitutive promoter sequence or an inducible 
promoter sequence, operatively linked so that the promoter enhances transcription of 
the nucleic acid molecule in a host celL 



19. (Amended) A vector comprising the nucleic acid molecule of [any one of claims 1 to 
4] claim 1 . 



21. 



(Amended) A host cell comprising the recombinant nucleic acid molecule of claim 1 
[or the vector of claim 19], or progeny of the host cell. 
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23. (Ameneded) A plant, a plant part, a seed, a plant cell or progeny thereof comprising 
the recombinant nucleic acid molecule of [claim 15 or the vector of claim 19] claim 
H. 

28. (Amended) An isolated polypeptide encoded by and/or produced from the nucleic 
acid molecule of [any of claims 1 to 4, or the vector of claim 19] claim 1 . 
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Examiner NYA 
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CERTIFICATE OF MAILING 

I hereby certify that this correspondence is being deposited with the United States Postal Service 
with sufficient postage as first class mail in an envelope addressed to the Commissioner for Patents, 
Washington, D.C. 20231, on Wednesday, June 12, 2002. 




Commissioner for Patents 
Washington, D.C. 20231 



THIRD PRELIMINARY AMENDMENT UNDER 37 CFR § L115 



Sir: 



Applicants request entry of the following amendment under 37 CFR 
§1.1 15(b)(2). The stams of the present application is that applicants are awaiting 
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receipt of a first Action on-the-merits. 

Applicants note that a copy of the present Amendment was originally filed on 
February 22, 2002. However, according to the PTO, that copy was not successfully 
matched with the present application. Applicants request, therefore, that the present 
copy serve as the official copy of the present Amendment and that the PTO ignore 
the copy mailed February 22, 2002. 

In the Description 

Please delete the paragraph added by the first Preliminary Amendment, filed 
February 19, 2002, (if already added to the specification, the paragraph starting at 
page 1, line 3) and add the following paragraph before the heading "Field of the 
Invention". 

—This is a U.S. national stage application based on international Application 
No. PCT/CAOO/00966, filed August 18, 2000, which claims priority to U.S. 
provisional Application No. 60/149,466, filed August 19, 1999, and to U.S. 
provisional Application No. 60/159,122, filed October 13, 1999.- 

REMARKS 

The above amendment has been made to correct a clerical error made in a 
first Preliminary Amendment, filed February 19, 2002. In that Amendment, the 
present application was misidentified as being a continuation-in-part of international 
Application No. PCT/CAOO/00966 when, in fact, it is a national stage application 
based thereon with no new matter added. Applicants confirm that no new matter 
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has been added by the first and second Preliminary Amendments which were filed 
on February 19, 2002. No new matter has been added by the present amendment. 
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CERTIFICATE OF MAILING 

I hereby certify that this correspondence is being deposited with the United States Postal Service 
with sufficient postage as first class mail in an envelope addressed to the Commissioner for Patents, 
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Washington, D.C. 20231 

FOURTH PRELIMINARY AMENDMENT UNDER 37 CFR $1.115 



Sir: 

Applicants request entry of the following amendments. 



In the Description 



Group No. NYA 
Examiner NYA 
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Please amend the paragraph, commencing at page 13, line 14, as follows. 



Figure II* Shows the nucleic acid molecules of [SEQ ID NOS:6 & 7] and the amino 
acid sequence of [SEQ ID NO: 8]. 

In a preferred embodiment, the figure shows the sequence of the predicted 
Arabidopsis gene - Accession number AAC98010 

A) Genomic Sequence [SEQ ID NO: 6]. The predicted open reading frame is 
underlined. The start codon (ATG) and stop codon (TGA) are double underlined. 

B) Translation of the predicted open reading frame [SEQ ID NOS:7 & 8]. The 
transmembrane domain is underlined. 



Please amend the paragraph, commencing at page 13, line 22, as follows. 

Figure 12. Shows the nucleic acid molecules of [SEQ ID NOS:9 & 10] and the 
amino acid sequence of [SEQ ID NO:l 1]. 

In a preferred embodiment, the figure shows the sequence of the predicted 
Arabidopsis gene - Accession number AAD 15491 

A) Genomic Sequence [SEQ ID NO:9]. The predicted open reading fi-ame is 
underlined. The start codon (ATG) and stop codon (TGA) are double underlined. 

B) Translation of the predicted open reading frame [SEQ ID NOS:10 & 1 1]. The 
transmembrane domain is underlined. 



Please amend the paragraph, commencing at page 13, line 30, as follows. 

Figure 13. Shows the nucleic acid molecules of [SEQ ID NOS:12 & 13] and the 
amino acid sequence of [SEQ ID NO: 14]. 

In a preferred embodiment, the figure shows the sequence of the predicted 
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Arabidopsis gene - Accession number CAA18823. 

A) Genomic Sequence [SEQ ID NO: 12]. The predicted open reading frame is 
underlined. The start codon (ATG) and stop codon (TGA) are double underlined. 

B) Translation of the predicted open reading frame [SEQ ID NOS:13 & 14]. The 
transmembrane domain is underlined. 

Please amend the paragraph, commencing at page 14, line 7, as follows. 

Figure 14. Shows the nucleic acid molecules of [SEQ ID NOS: 1 5 & 16] and the 
amino acid sequence of [SEQ ID NO: 17]. 

In a preferred embodiment, the figure shows the sequence of the predicted 
Arabidopsis gene - Accession number CAA 18590 

A) Genomic Sequence [SEQ ID NO: 15]. The predicted open reading frame is 
underlined. The start codon (ATG) and stop codon (TGA) are double underlined. 

B) Translation of the predicted open reading frame [SEQ ID NOS: 15 & 16]. The 
transmembrane domain is underlined. 

Please amend the paragraph, commencing at page 43, line 12, as follows. 

The isolation of novel Brassica napus receptor kinases relied upon the newly 
constructed cDNA library and involved in vivo mass excision of the pBluecsript 
phagemids from the Uni-ZAP XR vectors as outlined by the manufacturer (Stratagene, 
La Jolla, CA). Following efficient mass excision, phagemid DNA was extracted using 
a large scale alkaline protocol as described by Sambrook et al. (1989) and subjected to 
the polymerase chain reaction (PGR) using two separate oligonucleotide 
combinations, RK1/RK2 and RK1/RK3 (obtained from M. Cock, Ecole Normale 
Superieure de Lyon, France) specifically designed to prime conserved subdomains of 
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the catalytic domain of receptor protein kinases. RKl (5- 

ggiggTTTCggiAT^c^gTiTT^/cAA^gg - 3'; [SEQ ID NO: 18]) served as the forward 
primer and was constructed based upon a conserved amino acid consensus 
(GGFGIV%KG; [SEQ ID NO: 19]) within subdomain I of the catalytic domain. The 
degeneracy of one reverse primer RK2 (5' - AAiATiCygCCATiCC^gAA^/c -3'; 
[SEQ ID NO:20]) reflects a conserved amino acid consensus (DFGMARIF; [SEQ ID 
N0:21]) of subdomain VII which closely resembles the SRKs in Brassica. The 
second reverse oligonucleotide RK3 (5' - A^AiA^cTTigCiA^giCC^gAA^gTC - 3'; 
[SEQ ID NO:23]) was generated based upon conserved amino acids (DFGLAKLL; 
[SEQ ID NO:24]) within subdomain VII prevalent among the RLKs isolated in 
Arabidopsis. Phagemid DNA was amplified in a reaction mixture containing 1 
microliter of excised phagemid DNA, lOx PGR buffer (lOOmM Tris-HCl pH8.3, 
500mM KCl, 15mM MgCl2), lOmM deoxyribonucleotide triphosphate mixture, 10 
micromolar of each oligonucleotide primer and 0.5 microliter Tsg polymerase 
(BioBasics, Canada). The PGR reaction was heated at 95''C for 2 min and amplified 
for 35 cycles under the following amplification conditions: 1 min at 95^C for 
denaturation, 1 min 30 sec at 50*^0 for primer annealing and 1 min at 72°C for 
synthesis. A final extension cycle of 10 min at 72*^0 was also incorporated into the 
amplification program. All PGR products generated of the expected size (420-450 
bp) were gel purified, cloned into the pTTBlue plasmid (Novagen, Madison, WI) and 
introduced into Escherichia coli DH5- alpha. Transformants were tested for the 
presence of an insert and positive clones were sequenced with universal primers (R-20 
and U-19) by an. ABI automated sequencer (Model 373 STRETGH DNA; Perkin 
Elmer Gorp., Ganada Ltd.) using the dideoxychain-terminating method described by 
Sanger et al. (1977). Sequence analyses performed using DNAsis® software (Hitachi 
Software, San Bruno, GA) at the nucleotide and amino acid levels. 
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Please amend the paragraph, commencing at page 45, line 5, as follows. 

The 5' end of the PERKl cDNA was obtained by the procedure for the rapid 
amplification of cDNA ends originally described by Frohman et al. (1988) using the 5' 
RACE System, Version 2.0 kit (Gibco-BRL, Gainthersburg, MD). First strand cDNA 
was synthesized from approximately 300:g of mixed Westar and Wl pistil total RNA 
using a gene specific primer GSPl (5'-TAACCAACAAgAgACA-3'; [SEQ ID 
NO:22]) designed to anneal approximately 300 bp from the 5' end of the PERKl 
cDNA (1512 bp) isolated from the library screen. Following cDNA synthesis, the first 
strand product was purified from unincorporated dNTPs and GSPl using a GLASS 
MAX® spin cartridge. A homopolymeric tail was added to the 3' end of the cDNA 
using TdT (terminal deoxynucleotidyl transferase) and dCTP. Tailed cDNA was 
amplified using a second gene specific primer GSP2 (5'-CCACTCCCAACTTTCAAC 
-3'; [SEQ ID NO:25]) designed to anneal 3' to GSPl with respect to the cDNA, and 
an abridged anchor primer (Gibco-BRL, Gainthersburg, MD) which annealed to the 
homopolymeric tail. PGR ampHfication was carried out for 35 cycles of denaturation 
at 94^C for 1 min, primer annealing at 55''C for 1 min and extension at 72^C for 2 min, 
followed by a final extension cycle for 10 min. A PGR product of the expected size 
(-1 kb) corresponding to the 5'end of PERKl was gel purified, cloned into the 
pTTBlue plasmid (Novagen, Madison, WI) and transformed into Escherichia coli 
DH5- alpha. Confirmation of the 5 'RACE product was obtained by plasmid Southern 
blot analysis as described above and by sequential primer based sequencing. 

The paragraph, commencing at page 45, line 25, has been amended as follows. 

A PGR based approach was used to generate a fiiU length PERKl cDNA by 
combining the 5 'RACE product cloned into the EcoRV site of pT7Blue with the 
cDNA isolated from the library screen cloned into the EcoRI/XhoI sites of the 



U.S. Application No. 10/069,304 July 29, 2002 

Attorney Docket No. P 25,762 USA 

Page 6 

pBluescript SK phagemid. A forward primer (5'- 

ggAAAgCTTgCATgCCTgCAggTCgAC -3'; [SEQ ID NO:26]) containing an internal 
PstI site was designed to anneal upstream to the EcoRV cloning site of pTTBlue. A 
reverse primer (5'-CgCCTgCAggTAATACgACTCACTATAggg -3'; [SEQ ID 
NO:27]) also containing a PstI site was designed based on pBluecsript phagemid 
sequence immediately 3' to the EcoRI/XhoI cloning site. Full length PERKl cDNA 
was generated from a 100 microliter PCR reaction containing 1 microliter (~20ng) of 
each template (cDNA in pTTBlue and pBluescript phagemid), lOx Pfu Buffer 
(200mM Tris-HCl pH8.8, lOOmM (NH4)2S04, 20mM MgS04, 1% Triton®X-100, 
Img/mlBSA), lOmM dNTPs, SOpmol forward and reverse primers and Imicroliter Pfu 
polymerase (Gibco-BRL, Gainthersburg, MD). The samples were heated to 94°C for 
5 min and amplified for 30 cycles with a denaturing cycle of 1 min, a pimer annealing 
cycle at SS^'C for 1 min followed by an extension cycle for 3 min at 72°C. The 
resulting PCR product of the expected size (~2.2kb) was gel purified and cloned into 
the PstI restriction site of pBluescript KS (V ) II. The full length PERKl cDNA 
sequence was confirmed by a sequential primer based sequencing approach using both 
universal and sequence specific primers as previously described. All DNA and protein 
sequence analysis was performed using the DNAsis® Software (Hitachi Software, San 
Bruno,CA). 

In the Drawings 

Please add the attached Figures 1(c), 1(d), and 1(e). 



REMARKS 

The status of the present application is that applicants have not yet received a 
first Action on-the-merits. Amendments have been made to the descriptive portion of 
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the specification to bring it in conformity with a Sequence Listing being submitted 
concurrently. In addition. Figures 1(c), 1(d), and 1(e) were added. These figures were 
inadvertently left out of the present application but were included with U.S. 
provisional Application No. 60/159,122 which is incorporated into the present 
application by reference (see page 48 of the present application). No new matter has 
been added by the present amendment. 

A marked-up version of the changes made to the specification by the current 
amendment is attached. 



Synnestvedt & Lechner LLP 
Suite 2600, Aramark Tower 
1101 Market Street 
Philadelphia, PA 19107-2950 
(215) 923-4466 - Telephone 
(215) 923-2189 - Facsimile 



Submitted respectfully. 




Gene J. Yao 



Reg. No. 47,193 
Attomey for Applicants 
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VERSION WITH MARKINGS TO SHOW CHANGES MADE 
In the Description 

The paragraph, commencing at page 13, line 14, has been amended as follows. 

Figure 11. Shows the nucleic acid molecules of FSEO ID NOS:6 & 7] [[SEQ ID NO.: 
6]] and the amino acid sequence of [SEQ ID NO:8] [[SEQ ID NO.: 7]]. 

In a preferred embodiment, the figure shows the sequence of the predicted 
Arabidopsis gene - Accession number AAC98010 

A) Genomic Sequence [SEQ ID NO: 6] . The predicted open reading frame is 
underlined. The start codon (ATG) and stop codon (TGA) are double underlined. 

B) Translation of the predicted open reading frame [SEQ ID NOS:7 & 8] . The 
transmembrane domain is underlined. 

The paragraph, commencing at page 13, line 22, has been amended as follows. 

Figure 12. Shows the nucleic acid molecules of [SEQ ID NOS:9 & 10] [[SEQ ID 
NO.: 8]] and the amino acid sequence of [SEQ ID NO: 11] [[SEQ ID NO.: 9]]. 

In a preferred embodiment, the figure shows the sequence of the predicted 
Arabidopsis gene - Accession number AAD 15491 

A) Genomic Sequence [SEQ ID NO:91 . The predicted open reading frame is 
underlined. The start codon (ATG) and stop codon (TGA) are double underlined. 
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B) Translation of the predicted open reading frame [SEP ID NOS:10 & 1 11 . The 
transmembrane domain is underlined. 

The paragraph, commencing at page 13, line 30, has been amended as follows. 

Figure 13, Shows the nucleic acid molecules of [SEP ID NOS:12 & 131 [[SEQ ID 
NP.: 10]] and the amino acid sequence of [SEP ID NP:141 [[SEQ ID NP.: 1 1]]. 

In a preferred embodiment, the figure shows the sequence of the predicted 
Arabidopsis gene - Accession number CAA18823- 

C) Genomic Sequence [SEP ID NP:121 . The predicted open reading frame is 
underlined. The start codon (ATG) and stop codon (TGA) are double underlined. 

D) Translation of the predicted open reading frame [SEP ID NPS:13 & 141 . The 
transmembrane domain is underlined. 

The paragraph, commencing at page 14, line 7, has been amended as follows. 

Figure 14. Shows the nucleic acid molecules of [SEP ID NPS:15 & 161 [[SEQ ID 
NP.: 12]] and the amino acid sequence of [SEP ID NP:171 [[SEQ ID NP.: 13]]. 

In a preferred embodiment, the figure shows the sequence of the predicted 
Arabidopsis gene - Accession number CAA18590 

A) Genomic Sequence [SEP ID NP:151 . The predicted open reading frame is 
underlined. The start codon (ATG) and stop codon (TGA) are double underlined. 

B) Translation of the predicted open reading frame [SEP ID NPS: 15 & 161 . The 
transmembrane domain is underlined. 
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The paragraph, commencing at page 43, line 12, has been amended as follows. 

The isolation of novel Brassica napus receptor kinases relied upon the newly 
[constructedcDNA] constructed cDNA library and involved in vivo mass excision of 
the pBluecsript phagemids from the Uni-ZAP XR vectors as outlined by the 
manufacturer (Stratagene, La Jolla, CA). Following efficient mass excision, phagemid 
DNA was extracted using a large scale alkaline protocol as described by Sambrook et 
al. (1989) and subjected to the polymerase chain reaction (PCR) using two separate 
oligonucleotide combinations, RK1/RK2 and RK1/RK3 (obtained from M. Cock, 
Ecole Normale Superieure de Lyon, France) specifically designed to prime conserved 
subdomains of the catalytic domain of receptor protein kinases. RKl (5- 
ggiggTTTCggiAT^c^gTiTTVcAA^ggg - 3^ : TSEO ID NO: 181 ) served as the forward 
primer and was constructed based upon a conserved amino acid consensus 
rGGFGIV%KG : [SEP ID NO: 191) within subdomain I of the catalytic domain. The 
degeneracy of one reverse primer RK2 (5* - AAiATiCygCCATiCC%AA^/c - 3'; 
[SEP ID NO:201 ) reflects a conserved amino acid consensus (DFGMARIF : [SEP ID 
NO:21] ) of subdomain VII which closely resembles the SRKs in Brassica. The 
second reverse oligonucleotide RK3 (5' - AVA^^cTTigCiA^giCC^gAA^JC - 3'; 
[SEP ID NO:231) was generated based upon conserved amino acids (DFGLAKLL; 
[SEP ID NP:24];> within subdomain VII prevalent among the RLKs isolated in 
Arabidopsis. Phagemid DNA was amplified in a reaction mixture containing 1 
microliter of excised phagemid DNA, lOx PCR buffer (lOOmM Tris-HCl pH8.3, 
500mM KCl, 15mM MgCl2), lOmM deoxyribonucleotide triphosphate mixture, 10 
micromolar of each oligonucleotide primer and 0.5 microliter Tsg polymerase 
(BioBasics, Canada). The PCR reaction was heated at 95*^0 for 2 min and amplified 
for 35 cycles under the following amplification conditions: 1 min at 95°C for 
denaturation, 1 min 30 sec at 50®C for primer annealing and 1 min at 72°C for 
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synthesis. A final extension cycle of 10 min at 72^C was also incorporated into the 
amplification program. All PGR products generated of the expected size (420-450 
bp) were gel purified, cloned into the pTTBlue plasmid (Novagen, Madison, WI) and 
introduced into Escherichia coli DH5- alpha. Transformants were tested for the 
presence of an insert and positive clones were sequenced with universal primers (R-20 
and U-19) by an ABI automated sequencer (Model 373 STRETCH DNA; Perkin 
Elmer Corp., Canada Ltd.) using the dideoxy chain-terminating method described by 
Sanger et al. (1977). Sequence analyses performed using DNAsis® software (Hitachi 
Software, San Bruno, CA) at the nucleotide and amino acid levels. 

The paragraph, commencing at page 45, line 5, has been amended as follows. 

The 5' end of the PERKl cDNA was obtained by the procedure for the rapid 
amplification of cDNA ends originally described by Frohman et al. (1988) using the 5' 
RACE System, Version 2.0 kit (Gibco-BRL, Gainthersburg, MD). First strand cDNA 
was synthesized fi-om approximately 300:g of mixed Westar and Wl pistil total RNA 
using a gene specific primer GSPl (5^-TAACCAACAAgAgACA-3 ^ [SEP ID 
NO:22] ) designed to anneal approximately 300 bp from the 5' end of the PERKl 
cDNA (1512 bp) isolated from the library screen. Following cDNA synthesis, the first 
strand product was purified from unincorporated dNTPs and GSPl using a GLASS 
MAX® spin cartridge. A homopolymeric tail was added to the 3' end of the cDNA 
using TdT (terminal deoxynucleotidyl transferase) and dCTP. Tailed cDNA was 
amplified using a second gene specific primer GSP2 (5'-CCACTCCCAACTTTCAAC 
-3 \ [SEP ID NO:251 ^ designed to anneal 3' to GSPl with respect to the cDNA, and 
an abridged anchor primer (Gibco-BRL, Gainthersburg, MD) which annealed to the 
homopolymeric tail. PGR amplification was carried out for 35 cycles of denaturation 
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at 94**C for 1 min, primer annealing at SS^'C for 1 min and extension at ll'^C for 2 min, 
followed by a final extension cycle for 10 min. A PGR product of the expected size 
(—1 kb) corresponding to the 5 'end of PERK 1 was gel purified, cloned into the 
pT7Blue plasmid (Novagen, Madison, WI) and transformed into Escherichia coli 
DH5- alpha. Confirmation of the 5 'RACE product was obtained by plasmid Southern 
blot analysis as described above and by sequential primer based sequencing. 

The paragraph, commencing at page 45, line 25, has been amended as follows. 

A PCR based approach was used to generate a full length PERKl cDNA by 
combining the 5 'RACE product cloned into the EcoRV site of pTTBlue with the 
cDNA isolated from the library screen cloned into the EcoRI/XhoI sites of the 
pBluescript SK phagemid. A forward primer (5'- 

ggAAAgCTTgCATgCCTgCAggTCgAC -3' : [SEO ID NO:261) containing an intemal 
PstI site was designed to anneal upstream to the EcoRV cloning site of pTTBlue. A 
reverse primer (5'-CgCCTgCAggTAATACgACTCACTATAggg -3' : [SEP ID 
NO:27] ^ also containing a PstI site was designed based on pBluecsript phagemid 
sequence immediately 3' to the EcoRI/XhoI cloning site. Full length PERKl cDNA 
was generated from a 100 microliter PCR reaction containing 1 microliter (~20ng) of 
each template (cDNA in pTTBlue and pBluescript phagemid), lOx Pfu Buffer 
(200mM Tris-HCl pH8.8, lOOmM (NH4)2S04, 20mM MgS04, 1% Triton®X-100, 
Img/mlBSA), lOmM dNTPs, 50pmol forward and reverse primers and 1 microliter Pfu 
polymerase (Gibco-BRL, Gainthersburg, MD). The samples were heated to 94**C for 
5 min and amplified for 30 cycles with a denaturing cycle of 1 min, a pimer annealing 
cycle at 53^C for 1 min followed by an extension cycle for 3 min at 72*'C. The 
resulting PCR product of the expected size ('-2.2kb) was gel purified and cloned into 
the PstI restriction site of pBluescript KS CO H. The full length PERKl cDNA 
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sequence was confirmed by a sequential primer based sequencing approach using both 
universal and sequence specific primers as previously described. All DNA and protein 
sequence analysis was performed using the DNAsis® Software (Hitachi Software, San 
Bruno,CA). 
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-95 ttaactctctgrgrtctccgrtgrtctcctctcttctcctgrctgrcttccttttaacactctctt -36 
- 3 5 ca 1 1 tgcc 1 1 1 1 tga 1 1 tagra t ccaaagraagrcagracATGTCCTCGGCGCCGTCTCCGGGG 2 4 

MSSAPSPG 
2 5 ACTGGTTCGCCTCCTVTCTCCACCATCy^AACTCCACAACCy^CCACTCC^ 84 

TGSPPSPPSN*STTTTPPPAS 
8 5 GCTCCTCCTCCCACCACACCTTCTTCTCCTCCGCCGCCy^TCCACTATTCCGACATCTCCT 144 
APPPTTPSSPPPPSTIPTSP 
145 CCTCCTTCTTCTCGCTCTACACCTTCTGCTCCTCCTCCATCTCCACCAACTCCATCTACG 2 04 

PPSSRSTPSAPPPSPPTPST 
205 CCGGGATCTCCACCTCCTCTTCCTCAGCCGTCTCCACCCGCTCCAACTACGCCCGGATCT 264 

PGSPPPLPQPSPPAPTTPGS 
265 CCACCCGCACCTGTTACTCCTCCTACTCGAAACCCTCCACCTTCAGTCCCAGGACCACCG 324 

PPAPVTPPTRNPPPSVPGPP 
325 TCCAATCCTTCACGCGAAGGAGGATCTCCTCGACCTCCATCTTCTCCCTCGCCGCCGTCT 3 84 

SNPSREGGSPRPPSSPSPPS 
385 CCTTCTTCCGACGGTTTATCTACAGG AGTGGTGGTGGGAATCGCCT^TCGGAGGAGT 444 

PSSDGLSTGVVVGIAIGGVA 
445 CTGCTTGTGATAGTGACTCTGATTTGTCTCCTCTGTA AGAAGAAACGACGGAGAGACGAA 504 

LLVIVTLICLLCKKKRRRDE 
505 GAAGATGCTTACTATGTTCCTCCGCCACCTCCTCCTGGTCCCAAAGCCGGAGGACCTTAC 564 

EDAYYVPPPPPPGPKAGGPY 
565 GGTGGACAGCAGCAACAATGGCGGCAACAAAACGCAACACCACCGTCAGATCATGTCGTG 624 

GGQQQQWRQQNATPPSDHVV 
625 ACGTCACTACCACCACCACCTAAGGCTCCTITCTCCACCACGGCAACCTCCTCCACCTCC^ 684 
TSLPPPPKAPSPPRQPPPPP 

FIG. 1C(1) 
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685 CCACCGCCTTTCATGAGCAGCAGCGGCGGCTCCGACTACTCGGACCGTCCAGTTCTTCCT 

PPPFMSSSGGSDYSDRPVLP 
74 5 CCACCGTCTCC;VGGGCTTGTGTTAGGCTTCTCa\AAAGCACTTTCACATACGAGGAGCTA 

PPSPGLVLGFSKSTFTYEEL 
805 GCTAGAGCC7VCCAATGGTTTCTCCGAGGCGAACTTGTTAGGACAAGGCGGGTTCGGTTAC 

A R A T N ^ GFSEANIiLGQGGFGY 
865 GTGCACAAAGGTGTGTTGCCTAGTGGGAAAGAAGTTGCTGTGAAGCAGTTGAAAGTTGGG 
V H K G V L P S " gKE VAVKQLKVG 
925 AGTGGTCAGGGAGAGAGGGAGTTTCT^GGCTVGAGGTTGAGATCATCAGCA^^ 

S G "^ Q GEREFQAEVEI ISRV ^^ H H 

985 AGGCATCTGGTGTCTCTTGTTGGTTATTGCATCGCCGGTGCCAAAAGATTGCTTGTCTAT 
RHLVSLVGYCIA ^ GAKRLLVY 
104 5 GAGTTTGTTCCTAACAACAATCTCGAGCTTCACCTCCATGGCGAGGGACGGCCTACAATG 1104 

EFVPNNNLELHLHGEGR ^ P T M 
1105 GAATGGAGCACCAGATTGAAGATTGCTCTTGGATCTGCTAAAGGACTTTCTTATCTTCAT 1164 

EWSTRLKIALGSAKGLSYLH 
1165 GAAGATTGCAATCCTAAAATCATTCACCGTGATATCAAGGCTTCAAACATATTGATAGAT 1224 

EDCNPKI IHRDIKASNILID 
1225 TTCAAGTTTGAAGCTAAGGTTGCTGATTTTGGTCTTGCr AAGATTGCTTCTGATACAAAC 1284 

F K F ^" e AKVADFGLAKIASDTN 
1285 ACGCATGTATCAACACGTGTGATGGGAACCTTTGGGTACTTGGCTCCGGAATACGCTGCA 1344 

T H V ^" S TRVMGTFGYIiAPEY ^A_A 
1345 AGCGGAAAGCTCACGGAGAAGTCTGACGTTTTCTCATTTGGCGTTGTGCTTTTGGAGCTC 1404 
SGKLTEKSDVFSFGVVLLEL 

FIG. 1C(2) 



1405 ATTACTGGACGTCGACCCGTTGATGCCAACAATGTCTATGTAGATGACAGCTTAGTTGAC 14 64 

ITGRRPVDANN YVDDSLVD 
1465 TGGGCACGACCATTGCTTAACCGAGCATCTGAGCAAGGAGACTTTGAGGGTTTAGCTGAT 1524 

WARPLLNRASEQGDFEGLAD 
1525 GCAAAGATGAATAATGGGTATGACAGAGAGGAGATGGCTCGCATGGTTGCTTGTGCTGCG 1584 

AKMNNGYDRE MARMVACAA 
1585 GCTTGTGTTCGCCATTCAGCTCGCCGCAGACCTCGCATGAGCCAGATTGTGCGTGCGTTA 1644 

ACVRMSARRRPRMSQIVRAIi 
1645 GAAGGAAATGTATCACTGTCAGATCTTAACGAAGGGATGAGACCAGGTCAAAGCAATGTA 1704 

EGN*VSLSDLNEGMRPGQSNV 
1705 TACAGCTCATACGGAGGAAGCACCGATTATGACTCGAGCCAGTACAATGAAGACATGAAG 1764 

YSSYGGSTDYDSSQYNEDMK 
1765 AAGTTTAGGAAAATGGCACTTGGAACTCAAGAGTACAACGCCACGGGTGAGTACAGTAAT 1824 

KFRKMALGTQEYN*ATGEYSN 
1825 CCGACCAGTGACTATGGACTGTACCCGTCTGGTTCAAGCAGCGAGGGCCAAACC7VCA 1884 

PTSDYGLYPSGSSSEGQTTR 
1885 G7UUVTGGAGATGGGGAAGATTAAGAGAACCGGTCAGGGTTATAGTGGACCTTCTCTTTAA 1944 

EMEMGKIKRTGQGYSGPSL* 
194 5 accagratgrgrgragragaaattgraagggtgrttttttcattatttttttaaaactsrtaaagrata 2004 
2005 tgagraaaattgccttactctaattaaaaccactacgratataaggrttataatacgttttgra 2064 
2 065 at tgcft ttt taaaaaaaaaaaaaaaaaaa 2 093 
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Proline-rich Ext e nsin-like Rec e ptor Kinases 

FIELD OF THE INVENTION 

The invention relates to nucleic acid molecules and polypeptides involved in plant 
defense, and more particularly increasing plant resistance to wounding and pathogens. 

5 BACKGROUND OF THE INVENTION 

Receptor mediated signal perception and transduction in response to external stimuli 
are essential for growth and developmental processes of multicellular organisms (Mu et al., 
1994). These extensively well characterized processes in animal systems involve receptor 
protein kinase molecules comprised of an extracellular signal perception domain, a 

1 0 hydrophobic transmembrane domain attached to an intracellular domain that possesses kinase 
activity (Horn et al.» 1994). In general, transmembrane signaling by receptor protein kinases 
requires binding of an appropriate ligand to the extracellular domain which induces receptor 
dimerization and alters the activity of the intracellular catalytic domain. This promotes 
phosphorylation of specific substrates thereby initiating a protein kinase signaling cascade 

15 (Ullrich and Schlessinger, 1990). The majority of animal receptor protein kinases isolated to 
date contain tyrosine-specific kinase domains (Ullrich and Schlessinger, 1990), however, the 
transforming growth factor beta (TGF-beta) receptor (Lin et al., 1992) and the activin receptor 
(Dijke et al., 1993) possess kinase domains with serine/threonine phosphorylation activity. 

Intracellular communication is also essential for the growth and development of 
20 higher plants. The extensive knowledge of cell surface receptor signaling in animal systems 
has resulted in the isolation of several genes predicted to encode receptor-like protein kinases 
(RLKs). The characterized members of the RLK family share highly homologous catalytic 
domains with consensus sequences indicative of serine/threonine autophosphorylation 
activity, yet the extracellular domains of these receptors are very divergent (Braun and 
25 Walker, 1996). Five different classes of plant receptor-like protein kinases have therefore 
been identified according to amino acid sequence similarity in the extracellular domains of 
these genes. The first class of receptor kinases, designated the S-domain class, have distinct 
extracellular domains homologous to the S-locus glycoprotein (SLG) (Nasrallah and 
Nasrallah, 1993). S-domain receptor kinases have several distinguishing features such as ten 
30 conserved cysteine residues located proximal to tiie transmembrane domain in addition to 

other conserved residues implicated in the proper folding of the extracellular domain (Walker, 
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1994). Among this class of receptor kinases are the S-locus recqjtor kinases (SRKs) of 
Brassica expressed exclusively in reproductive tissues and implicated along with SLGs in 
controlling the sporophytic self-incompatibility response which normally inhibits self- 
pollination (Stein et al., 1991; Goring and Rothstein, 1992). Other receptor-like kinases of 
5 this type are represented in Arabidopsis by ARKl, ARK2, ARK3 (Tobias et al., 1992; Dwyer 
et al., 1994), in maize by ZmPKl (Walker and Zhang, 1990) and by OsPKlO in rice (Zhao et 
al., 1 994). The diversity in patterns of expression among members of the S-domain class 
shows that these plant receptor kinases are involved in mediating a variety of cellular 
signaling processes (Walker, 1994). 

10 Another class of plant receptor kinases is the leucine-rich repeat (LRR) group which 

encodes proteins witii extracellular domains containing 20-25 imperfect repeats of a 24 amino 
acid leucine-rich motif involved in peptide ligand recognition, cell adhesion and imphcated in 
mediating protein-protein interactions (Braun and Walker, 1996; Wang et al., 1998). This 
class of plant receptor kinases include proteins such as CLAVATAl (Clark et al., 1997) 

15 which is involved in regulating meristem and flower development in Arabidopsis, as well as 
proteins fimctioning in gamete development such as PRKl of Petunia (Mu et al., 1994). The 
LRR class is represented in Arabidopsis by other receptor kinases such as ERECT A (Torii et 
al., 1996) which has been shown to be essential for proper plant and organ elongation, BRIl, 
a receptor involved in brassinosteroid signal transduction (Li and Chory, 1997), as well as 

20 TMKl (Chang et al., 1992) and RLK5 (Walker, 1993) which may have more general roles in 
cellular signaling as suggested by their ubiquitous expression patterns in a variety of 
vegetative and reproductive tissues. Xa21, another member of the LRR class, has been 
implicated in pathogen recognition by providing resistance in rice to Xanthomonus oryzae pv. 
oryzae (Song et al., 1995). 

25 The lectin-like class of plant receptor kinases is represented only in Arabidopsis 

thaliana by Ath.lecRKl (Herve et al., 1996) and LRKl (Swamp et al., 1996). The 
extracellular domain of these receptor kinases share sequence similarity with lectins which 
are known carbohydrate binding proteins and implicated in the transduction of 
oligosaccharide signals in plant cellular communication processes (Herve et al., 1996). 

30 The two remaining classes of plant receptor kinases isolated in Arabidopsis thaliana 

include proteins with extracellular domains containing epidermal growth factor (EGF)-like 
motifs found in many proteins involved in extracellular interactions (WAKl; Kohom et al.. 
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1992) as well as thaumatin-like domains homologous to PR5 proteins involved in 
pathogenesis (PR5K; Wang et al., 1996). 

Plants remain very vulnerable to wounding and pathogens despite these advances. 
There is a need to identify other polypeptides, that help to protect plants. There is also a need 
5 for transgenic plants which overexpress these polypeptides and which have increased 
resistance to wounding and pathogens. 

SUMMARY OF THE INVENTION 

The invention relates to nucleic acid molecules and polypeptides involved in plant 
defense, and more particularly increasing plant resistance to wounding and pathogens. 

10 We isolated a cDNA clone designated PERKl (Proline-rich Extensin-like Receptor 

Kinase 1) which encodes a receptor kinase in Brassica napus. We define a new class of plant 
receptor kinases characterized by an extracellular domain rich in proline sharing sequence 
similarity to the ext^isin family of cell wall proteins. PERKl is induced by both wounding 
and chemical elicitors which mimick a pathogen attack, showing a role for PERKl in 

15 mediating a plant's defense response to mechanical and biological attack. Similar PERK 
nucleic acid molecules and polypeptides are foimd in other plants and cells. 

The invention relates to an isolated nucleic acid molecule encoding a proline-rich, 
extensin-like receptor kinase (PERK) polypeptide, or a jS-agment of a PERK polypeptide 
having PERK activity. The molecule is a signaling molecule associated with the cell wall via 

20 its extensin-like extracellular domain and is involved in the transduction of extracellular 

stimuli into an intracellular response through a cytoplasmic kinase domain, thereby bridging 
the cell wall-plasma membrane continuum. The extracellular stimuli includes wounding or 
pathogen attack. The wounding can include a cut, a break, a tear, a fold or an insect wound. 
Typical pathogens include bacterial pathogens, fungal pathogens, Sclerotinia sclerotiorum, 

25 Cylindrosporium concertricum, Phoma lingam, Pseudomonas syringae, Streptomyces 

scabies. Blackleg, Whiterust, Fusarium Head Bli^t, Rust, Bunt, Leaf Spot , White mold, 
root rot or Fusarium ear rot 

The invention also includes an isolated nucleic acid molecule encoding a PERK 
polypeptide, a fragment of a PERK polypeptide having PERK activity, or a polypeptide 
30 having PERK activity, comprising a nucleic acid molecule selected from the group consisting 
of: 
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(a) a nucleic acid molecule that hybridizes to a nucleic acid molecule consisting of [SEQ 
ID NO: 1 ], or a complement thereof under low, moderate or high stringency 
hybridization conditions wherein the nucleic acid molecule encodes a PERK 
polypeptide or a polypeptide having PERK activity; 

5 (b) a nucleic acid molecule degenerate with respect to (a), wherein the nucleic 

molecule encodes a PERK polypeptide or a polypeptide having PERK activity. 



The hybridization conditions preferably include low stringency conditions of IXSSC, 
0.1%SDSat50**C or high stringency conditions of 0.1 XSSC, 0.1% SDS at eS'^C. The 
10 invention also includes an isolated nucleic acid molecule encoding a PERK polypeptide, a 
fragment of a PERK polypeptide having PERK activity, or a polypeptide having PERK 
activity, comprising a nucleic acid molecule selected from the group consisting of: 

(a) the nucleic acid molecule of the coding strand shown in [SEQ ID NO: 1 ], or a 
complement thereof; 

1 5 (b) a nucleic acid molecule encoding the same amino acid sequence as a 

nucleotide sequence of (a); and 

(c) a nucleic acid molecule having at least 1 7% identity with the nucleotide 
sequence of (a) and which encodes a PERK polypeptide or a poypeptide having PERK 
activity. 

20 The invention also relates to an isolated nucleic acid molecule comprising a sequence 

that encodes a polypeptide having the sequence of SEQ ID NO:2, or the sequence of SEQ ID 
NO:2 with conservative amino acid substitutions. 

The PERK polypeptide preferably consists of or comprises a PERKl polypeptide. 

The nucleic acid molecule preferably comprising all or part of a nucleotide sequence 
25 shown in [SEQ ID NO:l or 3] or a complement thereof. The nucleic acid may consist of the 
nucleotide sequence shown in [SEQ ID NO:l or 3] or a complement thereof. The invention 
also includes a PERKl nucleic acid molecule isolated from Brassica^ or a fragment thereof. 
The Brassica may include Brassica napus, Brassica juncea, Brassica rapa or Brassica 
oleracea. 
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Another embodiment of the invention includes a recombinant nucleic acid molecule 
comprising a nucleic acid molecule of the invention and a constitutive promoter sequence or 
an inducible promoter sequence, operatively linked so that the promoter enhances 
transcription of the nucleic acid molecule in a host cell. 

5 The nucleic acid molecule of the invention optionally includes genomic DNA, cDNA 

or RNA . The nucleic acid molecule is optionally chemically synthesized. 

The invention also includes an isolated nucleic acid molecule comprising a nucleic 
acid molecule selected firom the group consisting of 8 to 10 nucleotides of the nucleic acid 
molecule of claim 6, 1 1 to 25 nucleotides of the nucleic acid molecule of claim 6 and 26 to SO 

1 0 nucleotides of the nucleic acid molecule of claim 10. The nucleic acid molecule of the 

invention comprising at least 10, 15, 20, 30, 50 or 100 consecutive nucleotides of [SEQ ID 
NO:l or 3] or a complement thereof. These sequences are useful as hybridization probes to 
detect PERKl in a sample. They are also preferably useful as antisense oligonucleotides to 
inhibit gene expression. The invention also includes a nucleic acid molecule probe encoding 

15 all or part (at least 10, 15, 20, 30, 50 or 100 amino acids) of PERKl polypeptide. 

Another aspect of the invention includes a vector comprising a nucleic acid molecule 
of the invention. The vector optionally comprises a promoter selected from the group 
consisting of a super promoter, a 35S promoter of cauliflower mosaic virus, a chemical 
inducible promoter, a copper-inducible promoter, a steroid-inducible promoter and a tissue- 
20 specific promoter. 

Another variation of the invention includes a host cell comprising the recombinant 
nucleic acid molecule or the vector of the invention or progeny of the host cell. The host cell 
of the invention is optionally selected from the group consisting of a frmgal cell, a yeast cell, a 
bacterial cell, a microorganism cell and a plant cell. The invention includes a method of 
25 producing polypeptide, comprising culturing a host cell of the invention under conditions 
pemiitting expression of the polypeptide. The method preferably fiirther includes isolating 
the protein from the cell or the cell medium. 

The invention also includes a plant, a plant part, a seed, a plant cell or progeny thereof 
comprising a recombinant nucleic acid molecule or vector of the invention. The plant part 
30 optionally comprises all or part of a leaf, a flower, a stem, a root or a tuber. The plant, plant 
part, seed or plant cell is optionally selected from a species from the group consisting of 
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Brassica napus, Brassica rapa, Brassica juncea, Brassica oleracea^ or from the family 
Brassicaecae, Arabidopsis, potato, tomato, tobacco, cotton, carrot, petunia, sunflower, 
strawberries, spinach, lettuce, rice, soybean, com, wheat, rye, barley, sorgum and alfalfa. The 
plant may comprise a dicot plant or a monocot plant. The invention also includes an isolated 
5 polypeptide encoded by and/or produced from a nucleic acid molecule or tiie vector of the 
invention. The polypqitide is preferably an isolated PERK polypeptide or a fragment thereof 
having PERK activity. The invention also includes an isolated polypeptide, the amino acid 
sequence of which comprises at least ten consecutive residues of [SEQ ID NO:2]. The 
invention also includes an isolated immunogenic polypeptide, the amino acid sequence of 

1 0 which comprises at least ten consecutive residues of [SEQ ID NO:2 ]. The invention also 
includes an isolated polypeptide, the amino acid sequence of which comprises residues 1 to 
137, 138 to 160 and 161 to 648 of [SEQ ID NO:2]. The polypeptide preferably comprises all 
or part of an amino acid sequence in [SEQ ID NO:2]. The invention also includes a 
polypeptide comprising or consisting of the sequence of SEQ ID NO:2 or 4, or the sequence 

15 of SEQ ID NO:2 or 4 with conservative amino acid substitutions. A variation of the 

invention comprises a polypeptide fragment of the PERK polypeptide, or a peptide mimetic 
of the PERK polypeptide (eg. PERKl). The polypeptide fragment of the invention preferably 
comprises at least 20 amino acids, which fragment has PERK activity. The fragment or 
peptide mimetic is preferably capable of being bound by an antibody to a polypeptide of tiie 

20 invention, such as PERKl . The polypeptide is optionally recombinantly produced. The 

invention includes an isolated and purified polypeptide comprising the amino acid sequence 
of a PERK polypeptide, wherein the polypeptide is encoded by a nucleic acid molecule that 
hybridizes imder moderate or stringent conditions to a nucleic acid molecule in [SEQ ID 
NO:l or 3], a degenerate form thereof or a complement. The invention also includes a 

25 polypeptide comprising an amino acid sequence having greater than 20% sequence identity to 
the polypeptide of [SEQ ID NO:l or 3], The polypeptide preferably comprises a PERK 
polypeptide. The polypeptide is preferably isolated from Brassica^ for example, Brassica 
napus or Brassica juncea or Brassica rapa or Brassica oleracea. 

The polypeptide preferably includes a kinase domain including at least 30% homology 
30 to the kinase domain of [SEQ ID NO.:2] and/or an extracellular domain including at least 
20% homology to the extracellular domain of [SEQ ID NO.:2]. The invention includes an 
isolated nucleic acid molecule encoding a polypeptide the invention eg. [SEQ ID NO.:2]. 
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The invention also includes an antibody that binds specifically to a polypeptide of the 
invention, in particular [SEQ ID NO-:2]. The antibody optionally comprises a monoclonal 
antibody or a polyclonal antibody. The invention also includes a purified polypeptide that 
binds specifically to an antibody that binds specifically to PERKl . Such proteins are usefiil, 
5 for example^ as a positive control in an assay utilizing the antibody. 

The invention also includes an isolated nucleic acid molecule encoding a polypeptide 
that reduces the severity of wounding or pathogen attack in a plant, the polypeptide 
comprising: 

(a) an extracellular domain which recognizes an extracellidar binding molecule whose 

1 0 level is increased during the wounding or pathogen attack, the extracellular domain encoding 
a plurality of repeats selected firom the group consisting of SPPPP, SPP, PP and PPP, wherein 
a plurality of the proline molecules are capable being glycosylated and/or hydroxylated; 

(b) a membrane domain operably connected to the extracellular domain, wherein the 
membrane domain is capable of extending across a cell membrane fi-om the extracellular side 

15 of the membrane to intracellular side of the membrane; and 

(c) a cytoplasmic domain operably connected to the membrane domain, wherein 
the cytoplasmic domain comprises a means for producing kinase activity when the 
extracellular binding molecule interacts with the extracellular domain. 

The term "isolated nucleic acid" refers to a nucleic acid the structure of which is not 
20 identical to that of any naturally occurring nucleic acid or to that of any fi-agment of a 

naturally occurring genomic nucleic acid spanning more than three separate genes. The terai 
therefore covers, for example, (a) DNA which has the sequence of part of a naturally 
occurring genomic DNA molecules; (b) a nucleic acid incorporated into a vector or into the 
genomic DNA of a prokaryote or eukaryote, respectively, in a manner such that the resulting 
25 molecule is not identical to any naturally occurring vector or genomic DNA; (c) a separate 
molecule such as cDNA, a genomic firagment, a fragment produced by reverse transcription of 
polyA RNA which can be amplified by PCR, or a restriction fi:^gment; and (c) a 
recombinant nucleotide sequence that is part of a hybrid gene, i.e., a gene encoding a fiision 
protein. Specifically excluded from this definition are nucleic acids present in mixtures of (i) 
30 DNA molecules, (ii) transfected cells, and (iii) cell clones, e.g., as these occur in a DNA 
library such as a cDNA or genomic DNA library. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Preferred embodiments will be described in relation to the drawings in which: 
Figure 1. Nucleotide and Deduced Amino Sequence. [SEQ ID NOS.: 1 & 2] 

(A) Figure 1 shows the nucleic acid molecule of [SEQ ID NO.: 1] and the amino acid 
5 sequence of [SEQ ID NO.: 2]. 

In a preferred embodiment, tiie figure shows the nucleotide sequence corresponding to the 
PERKl cDNA, with the predicted amino acid sequence presented as a single-letter code 
below the nucleotide sequence. Numbers to the left refer to nucleotide sequence and the 5* 
and 3' untranslated regions are presented in lower case letters. Potential N-glycosylation site 

1 0 (Asn-x-Ser/Thre) are indicated by dots above the Asn residues, and the extensin signature 

(Ser-Pro)4 pentapeptide motif present in the extracellular domain is denoted in boldface type. 
The predicted membrane spanning region is marked by a solid imderline. As defined by 
Hanks and Quinn (1991), the catalytic domain has been subdivided into 1 1 subdomains 
marked by dashed underlines and superscript roman numerals. The amino acids with a 

1 5 bracket underneath represent residues that are absolutely conserved, whereas amino acids 
with a wavy line underneath represent groups of conserved amino acids. The two regions 
marked by double underlines represent consensus sequences common amongst 
serine/threonine kinases. 

(B) Stmctural features of the PERKl polypeptide. A Kyte hydropathy plot (Kyte and 
20 Doolittle, 1 982) of the amino acid sequence generated by DNAsis® software (Hitachi 

Software, San Bruno, CA) is shown, where increased hydrophobicity is denoted by positive 
values. The domains of the PERKl protein are illustrated below. ECD, extracellular domain; 
TM, transmembrane domain. 

(C) Shows the nucleic acid molecule of [SEQ ID NO.: 3] and the amino acid sequence of 
25 [SEQ ID NO.: 4]. These sequences are identical to nucleotides 1 to 1944 of [SEQ ID NO:l] 

and the corresponding amino acid sequence but they also include 5' and 3' untranslated 
nucleotide regions. (D) Shows the nucleic acid molecule of [SEQ ID NO.: 3] including the 5' 
and 3' untranslated regions. The start and stop codons are underlined and in light print. The 
nucleotide numbers correspond to the entire sequence and not only to the coding region. In a 
30 preferred embodiment, this is the coding sequence of PERKl . 
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(E) Figure 1 shows the amino acid sequence of [SEQ ID NO.: 5] including amino acids 
corresponding to the 5' and 3' untranslated regions. The starting methionine is in Ught print 
and the stop codon is indicated by an asterisk. It will be clear to those skilled in the art that 
one may only use the portion of the amino acid sequence between the start and stop codons. 
5 Figure 2. Genomic DNA Southern Blot Analysis of PERKl . 

Genomic DNA (5 micrograms) isolated from Brassica napus leaf tissue was digested with the 
indicated restriction enzymes, blotted and hybridized with a partial 1.5 kb PERKl cDNA 
probe under varying conditions of stringency, DNA markers are indicated in kilobases. 

(A) Genomic DNA gel blot analysis under low stringency conditions. 

1 0 (B) Genomic DNA gel blot analysis under high stringency conditions. 

Figure 3. Expression of PERKl cDNA 

(A) RNA gel blot analysis of PERKl transcripts from total RNA extracted 
fix>m various Brassica napus tissues. The blot was hybridized with the full length 
PERKl cDNA probe and the expected --2.2kb PERKl transcript was detected (upper panel). 
15 (B) The blot was subsequently probed with 1 8S rRNA as an internal 
control for even loading (lower panel). 

Figure 4. Wound-Inducible Accumulation of PERKl mRNA in Brassica napus Leaf and 
Stem Tissue. 

(A) Fully expanded leaves were wounded by punching out discs around the perimeter of the 
20 leaf blade. Wounds mimic injury inflicted on plants in the field as a resuh of insect attack or 

other mechanical damage. Total RNA was extracted at various time intervals after treatment, 
subjected to Northem blot analysis and probed with full length PERKl cDNA (bold-face 
arrow). The blot was reprobed with cyclophilin used as an internal control for even loading 
(open-face arrow). The graph represents the expression profile of PERKl in response to 
25 wounding corrected against levels of cyclophilin expression. Error bars represent the standard 
error derived from two independent experiments. Control unwounded leaf tissue represented 
by 0 hr time point. 

(B) Northem blot showing a time-course induction of PERKl mRNA accumulation in 
wounded stem tissue. Total RNA harvested at the indicated time points was blotted and 

30 hybridized against the fiill length PERKl coding sequence (bold-face arrow). The cyclophilin 
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loading control (open-face arrow) was used to normalize levels of PERKl mRNA 
accumulation represented graphically. Error bars represent the standard error derived from 
two independent experiments. Control unwounded stem tissue represented by 0 hr time point. 

Figure 4a. Wound-Inducible Accumulation of PERKl mRNA in Brassica napus Leaf Disc 
5 Tissue. 

The effects of a more localized wound defense response on the levels of PERKl mRNA 
accimiulation was investigated using wounded leaf discs. Total RNA was extracted from leaf 
discs at various time intervals, subjected the Northem blot analysis and probed with fee full 
lengfe PERKl cDNA (bold-face arrow). The blot was reprobed wife cyclophilin as an 
1 0 internal control for even loading (open-face arrow ). The graph represents fee steady state 
levels of PERKl mRNA in response to woimding corrected against levels of cyclophilin 
expression. Control unwounded leaf tissue is represented by fee Ohr time point. 

Figure 4b. Woimd-Inducible Accumulation of PERKl mRNA in Brassica napus Leaf and 
Stem Tissue. 

1 5 (A) Fully expanded leaves were wounded by rubbing fee undersides wife abrasive sand 
paper. Total RNA was extracted from fee leaf at various time intervals after treatment, 
subjected fee Norfeem blot analysis and probed wife fee foil lengfe PERKl cDNA (bold- 
face arrow). The blot was reprobed wife cyclophiHn as an intemal control for even loading 
(open-face arrow). The graph represents fee steady state levels of PERKl mRNA in response 

20 to woimding corrected against levels of cyclophilin expression. Control unwounded leaf 
tissue is represented by fee Ohr time point. 

(B) Norfeem blot analysis showing a time-course induction of PERKl mRNA accumulation 
in stem woimded by robbing wife abrasive sand paper. Total RNA harvested at fee indicated 
time points was blotted and hybridized against fee foil lengfe PERKl cDNA (bold-face 
25 arrow). The cyclophilin loading control (open-face arrow) was used to normalize levels of 
PERKl mRN accimiulation represented graphically. Control unwounded stem tissue 
represented by Ohr time point. 

Figure 4c. Wound-Inducible Accumulation of PERKl mRNA in Brassica napus Root 
Tissue, 

30 Root tissue from hydroponically grown B. napus plants was used to investigate whefeer levels 
of PERKl mRNA increase in response to a wounding stimulus. Roots were wounded by 
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slicing tissue into 3cm sections and incubating on filter paper moistened with 20mM 
phosphate buffer supplemented with chloramphenicol. Total RNA extracted at various time 
intervals after treatment, was subjected the Northem blot analysis and probed with the fiill 
length PERKl cDNA (bold-face triangle). The blot was reprobed with cyclophilin as an 
5 internal control for even loading (open-face triangle). The graph represents the expression 
profile of PERKl in response to woimding corrected against levels of cyclophilin expression. 
Control unwounded root tissue is represented by Ohr time point 

Figure 5. Effects of 50 micromolar Methyl Jasmonate (MeJA) on PERKl mRNA 
Accumulation in Treated Brassica napus Leaf and Stem Tissue. 

1 0 (A) Brassica napus plants were tiioroughly sprayed with a 50 micromolar MeJA solution, 
and leaf tissue subsequently harvested at different time intervals after treatment. Total RNA 
prepared firom treated leaf tissue was subjected to Northem blot analysis and probed with full 
length PERKl cDNA (open-face triangle). Control plant (0 hr) was treated with the carrying 
solution minus the chemical inducer (0.1% [7v] ethanol for MeJA). The blot was reprobed 

1 5 with cyclophilin used as an internal control for even loading (bold-face triangle). The graph 
represents a corrected profile for the levels of PERKl mRNA accumulation in response to 
treatment with MeJA normalized against levels of cyclophilin expression. 

(B) Northem blot showing a time-course induction of PERKl mRNA accumulation in 
MeJA treated stem tissue. Total RNA harvested at the indicated time points was blotted and 
20 hybridized against the full length PERKl coding sequence (open-face triangle). The 

cyclophilin loading control (bold-face triangle) was used to normalize levels of PERKl 
mRNA accumulation represented graphically. 

Figure 6, Effects of 4mM Salicylic Acid (SA) on PERKl mRNA Accumulation in Treated 
Brassica napus Leaf and Stem Tissue. 

25 (A) Brassica napus plants were thoroughly sprayed with a 4mM SA solution, and leaf tissue 
subsequently harvested at different time intervals after treatment. Total RNA prepared firom 
treated leaf tissue was subjected to Northem blot analysis and probed with full length PERKl 
cDNA (open-face triangle). Control plant (0 hr) was treated with the carrying solution minus 
the chemical inducer (5mM phosphate buffer, pH7). The blot was reprobed with cyclophilin 

30 used as an internal control for even loading (open-face triangle). The graph represents a 
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corrected profile for the levels of PERKl mRNA accumulation in response to treatment with 
SA normalized against levels of cyclophilin expression. 

(B) Northern blot showing a time-course induction of PERKl mRNA accumulation in SA 
treated stem tissue. Total RNA harvested at the indicated time points was blotted and 
5 hybridized against the full length PERKl coding sequence (open-face triangle). The 
cyclophilin loading control (bold-face triangle) was used to normalize levels of PERKl 
mRNA accumulation represented graphically. 

Figure 7. B. napus leaf tissue was used to investigate whether levels of PERKl mRNA 
increase in response to treatment with a fungal pathogen Sclerotinia sclerotiorum. Brassica 

1 0 napus leaves were excised from the plants placed in orchid tubes with water and incubated in 
closed aluminum trays at room temperature under fluorescent light to generate a humid 
enviroimient. The leaves were inoculated with fungal agar plugs and incubated for the 
indicated time points, after which the tissue was harvested and immediately frozen in liquid 
nitrogen for fiuther analysis. Control time points were inoculated with agar plugs only. Total 

15 RNA was extracted and Northern blot analysis was subsequently performed. Blots were 
probed with the full length PERKl cDNA and then with cyclophilin as a control for even 
loading. Throughout the experiment, it was noted that the fungal agar plugs began to adhere 
well to the tissue 7hrs following inoculation and by 15hrs post-inocluation macroscopic 
lesions were apparent on the surface of the leaves. The corrected profile for PERKl mRNA 

20 accumulation in response to this fimgal pathogen treatment shows that there was a 2.5 fold 
induction in the levels of PERKl mRNA lOhrs following treatment. 

Figure 8. Proposed pathway mediating PERKl expression in response to wounding, MeJA 
and S A treatments. 

Figure 9. Analysis of Kinase Activity and Phosphoamino Acid Analysis of Recombinant 
25 PERKl Protein. (A) Purification of recombinant mutant (K-E) and wild type catalytic domain 
fusion proteins. Extracts of E. coli cells harboring the fusion proteins were purified on MBP 
amylose resin. Proteins were visualized by Coomassie blue staining. The mutant CD fusion 
protein is present in lane I and the wild type CD fusion protein is represented in lane 2. 
(B) Western blot analysis of recombinant fusion proteins. Western blot analysis using an 
30 anti-MBP antibody was performed on purified CdmutK-E and CDwt to confirm the identity of 
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these proteins. Lane 1 represents detection of the mutant fusion protein and the signal 
obtained in lane 2 verifies the identity of the wild type fusion protein. 

(C) Autoradiogram of autophosphorylation assay of recombinant PERKl CDwt and 
CdmutK-E fusion proteins. Detection of a signal in lane 1 proves that the wild type catalytic 

5 domain of PERKl is capable of autophosphorylation. Absence of a signal for the CdmutK-E 
in lane 2 confirms that the result obtained for CDwt is specific to the kinase activity of 
PERKl and not due to bacterial kinase contamination. 

(D) Phosphoamino acid analysis of autophosphorylated PERK, y ^^P-labeled CDwt was 
hydrolyzed with HCl and subjected to two-dimensional electrophoresis. Presence of 

10 radiolabeled phospho-serine (pS) and phospho-threonine (pT) confirms that PERKl encodes 
an active protein kinase with serine/threonine substrate specificity. 

Figure 10. Shows sequence identity of PERKl to polypeptides from the Arabidopsis genome 
sequencing project. 

Figure 11. Shows the nucleic acid molecule of [SEQ ID NO.: 6] and the amino acid 
15 sequence of [SEQ ID NO.: 7]. 

In a preferred embodiment, the figure shows the sequence of the predicted Arabidopsis gene - 
Accession number AAC98010 

A) Genomic Sequence. The predicted open reading frame is underlined. The start codon 
(ATG) and stop codon (TGA) are double underlined. 
20 B) Translation of the predicted open reading fi-ame. The transmembrane domain is 
underlined. 

Figure 12. Shows the nucleic acid molecule of [SEQ ID NO.: 8] and the amino acid 
sequence of [SEQ ID NO.: 9]. 

In a preferred embodiment, the figure shows the sequence of the predicted Arabidopsis gene - 
25 Accession number AAD 1 549 1 

A) Genomic Sequence. The predicted open reading frame is underlined. The start codon 
(ATG) and stop codon (TGA) are double underlined. 

B) Translation of the predicted open reading fi-ame. The transmembrane domain is 
underlined. 

30 Figure 13. Shows the nucleic acid molecule of [SEQ ID NO.: 10] and the amino acid 
sequence of [SEQ ID NO.: 11]. 
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In a preferred embodiment, the figure shows the sequence of the predicted Arabidopsis gene - 
Accession number CAA18823. 

A) Genomic Sequence. The predicted open reading frame is underlined. The start codon 
(ATG) and stop codon (TGA) are double underlined. 

5 B) Translation of the predicted open reading frame. The transmembrane domain is 
underlined 

Figure 14. Shows the nucleic acid molecule of [SEQ ID NO.: 12] and the amino acid 
sequence of [SEQ ID NO.: 13]. 

In a preferred embodiment, the figure shows the sequence of the predicted Arabidopsis gene - 
1 0 Accession number CAAl 8590 

A) Genomic Sequence. The predicted open reading fi^me is underlined. The start codon 
(ATG) and stop codon (TGA) are double underlined. 

B) Translation of the predicted open reading fi-ame. The transmembrane domain is 
imderlined. 

15 DETAILED DESCRIPTION OF THE INVENTION 

hi this study, we report the isolation and i>reliminary characterization of PERKl 
nucleic acid molecules and polypeptides, and in particular PERKl cDNA which encodes a 
novel receptor-like protein kinase in Brassica napus. PERK polypeptides represent a novel 
class of receptor kinases in high^ plants. 

20 Protein kinases play important roles in plant defense (Zhou et al., 1995; Usami et al., 

1995; Suzuki and Shinhi, 1995). Significant homology is shared between the extracellular 
domain of PERKl and both extensin and proline rich proteins. PERKl mediates plant 
responses to mechanical wounding (for example insect attack) and pathogen attack. PERK 
polypeptides preferably include a catalytic domain. PERK polypeptides are also preferably 

25 signaling molecules associated with the cell wall via their extensin-like extracellular domain 
and involved in the transduction of extracellular stimuli (e.g. wounding, pathogen attack) 
into an intracellular response through a transmembrane domain and a cytoplasmic kinase 
domain, thereby bridging the cell wall - plasma membrane continuum. 

In general, plants challenged by mechanical wounding or pathogen attack induce rapid 
30 expression of genes (i.e. proteinase inhibitor (pin) and pathogenesis related (PR) genes 
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respectively) that are expressed locally as well as systemically in unaffected parts of the plant 
(Yang et al., 1997). Increased levels of extensin transcripts as a result of mechanical 
wounding have been well established in many other systems (Sauer et al., 1990; Shirsat et al., 
1996). For example, in Brassica napus leaf and stem tissue, wound induction of PERKl 
5 mRNA accumulation is a very rapid response detectable within 15 min following injury 

(Figure 4, 4a-b). Increased levels of PERKl mRNA were also detected in wounded root tissue 
within 5 min following treatment (Figure 4c). MeJA (the methyl ester of the plant growth 
regulator jasmonic acid (JA)) is involved in the signal transduction pathway regulating gene 
activation upon wounding. Steady state levels of PERKl mRNA remain unaffected by 

1 0 exogenously apphed MeJA (Figure 5) which shows that the inducibility of PERKl by 
wounding occurs via a MeJA-independent pathway (Figure 7). Studies conducted by 
Titarenko et al. (1997) addressing the role of JA in mediating wound responses support the 
existence of multiple distinct wound signal transduction pathways. Exogenously applied JA 
was able to induce only a subset of wound responsive genes in Arabidopsis which ultimately 

1 5 resulted in a stronger systemic accumulation in wounded plants. Conversely, a second set of 
wound responsive genes showing a stronger induction locally in wounded tissue showed no 
substantial accumulation upon JA treatment. In conjunction with the pattern of PERKl 
mRNA accumulation in response to wounding and MeJA, it appears that plants respond to 
wounding by two distinct wound signal transduction pathways: one which does not require 

20 JA and is primarily responsible for gene activation in the vicinity of the wound site and the 
other which involves JA perception and activates gene expression both locally and 
systemically to the wound site (Titarenko et al., 1997), 

Many of the inducible defense responses are not exclusive to mechanical wounding 
but are also initiated by pathogen attack. The similarity between responses to wounding and 

25 pathogen attack are not surprising since mechanical damage often precedes pathogen 

infection and conversely, mechanical damage may often result fiom a pathogen or insect 
attack (Truemit et al., 1996). SalicyKc acid has been impUcated in having an important role 
in the signal transduction pathway leading to systemic acquired resistance (SAR) (Penninckx 
et al., 1996). Steady state levels of PERKl mRNA also accumulated in B. napus leaf and 

30 stem tissue upon exogenous appUcation of 4mM SA (Figure 6). Collectively, the profiles of 
PERKl mRNA accumulation in response to wounding, MeJA and SA are not entirely 
surprising. PERKl induction is rapid in response to wounding (Figure 4, 4a-c) and the lack 
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of PERKl transcript accumulation in response to MeJA (Figure 5) shows a pathway for 
wound mediated induction of PERKl that is independent of MeJA (Figure 7). The 
pronounced and rapid induction of PERKl in response to exogenous SA (Figure 6) supports 
other studies showing that SA is known to inhibit woimd responsive genes that are regulated 
5 by a MeJA-dependent pathway (Pena-Cortes et aL, 1993; Doares et aL, 1995). Therefore, it is 
unlikely that both MeJA and SA would induce PERKl mRNA accumulation given that these 
pathways are known to be antagonistic (Peiia-Cortes et al., 1993). Nevertheless, the rapid 
induction of PERKl during these treatments shows a role early on in a plant's defense 
signaling pathway. 

1 0 Characterization of PERKl 

Genomic Southern blot analysis under low and high stringency conditions revealed 
that PERKl is a single copy gene in the Brassica genome (Figure 2). PERKl is ubiquitously 
expressed at high levels in stem, petal and pistil tissue and is less abundant in root, leaf and 
anther tissues ( upper panel). 

15 The deduced amino acid sequence of PERKl shows that it is a transmembrane 

receptor kinase with a distinct extracellular, transmembrane and cytoplasmic domain (Figure 
1). The extracellular domain of PERKl shows sequence similarity to plant cell wall proline- 
rich proteins and extensins which comprise a family of hydroxyproline-rich glycoproteins 
(HRGPs). Extensins are particularly abimdant proteins in plant cell walls and are very rich in 

20 proline and serine as well as in combinations of valine, tyrosine, I>^ine and/or histidine 
residues. The distinctive characteristic of dicot extensins is their repetitive (Ser-Pro)4 
pentapeptide blocks. Although extensins are synthesized as soluble precursors, the majority 
of proline residues are hydroxylated and both the hydroxylated proline as well as the serine 
residues of these proteins are glycosylated by post-translational modifications (Cassab, 1998). 

25 When secreted to the plant cell wall, extensins become rapidly insoluble, presumably due to 
the formation of covalent isodityrosine bridges (Cassab, 1998). Although extensins have 
been proposed to be structural cell wall proteins and important in development, they have also 
been directly implicated in plant defense against mechanical wounding (Shirsat et al., 1996) 
and pathogen attack (Corbin et aL, 1987; Showalter, 1993). The catalytic domain of PERKl 

30 possesses all of the invariant residues necessary for kinase activity and sequence similarity in 
subdomains VI and Vm to amino acid consensus sequences characteristic of serine/threonine 
kinases shows a role for PERKl in plant signal transduction (Hanks and Quinn, 1991). Given 
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the similarity of PERKl in the extracellular domain to the extensin family of cell wall 
proteins, PERKl can detect changes to the cell wall through mechanical damage or pathogen 
attack and then pass the signal onto the celL The cell can then respond to the attack with its 
defence mechanisms. 

5 PERK nucleic acid molecules and proteins also have sequence identity and similarity 

to proline rich proteins as well. About 40% of PERKl 's extracellular domain is comprised of 
proline. Below is a list of the Arabidopsis clones and their respective proUne composition 
(Table below). 

Table 1 

10 % Prolines in Extracellular Domains 



Gene 


% Proline in ECD 


PERKl 


56/137 = 41% 


CA18590 


105/279 = 38% 


AAC98010 


85/246 = 34% 


CAA18823 


51/179 = 28% 


AAD15491 


36/149 = 24% 



It is possible to use all or part of a proline rich domain fix>m an extensin or a proline rich 
protein (or similar regions) to r^lace all or part of PERKl 's extracellular domain. 

In summary, PERKl is a unique plant protein in Brassica napus involved in wound 
15 and pathogen response which physically links the cell wall and plasma membrane. PERKl is 
involved in the general perception and subsequent transduction of a wound and/or pathogen 
stimulus, ultimately triggering a plant's defense mechanisms and conferring broad protection 
against such stimuli. Preliminary characterization showed that levels of PERKl mRNA 
accimiulate rapidly in response to wounding and S A. Further characterization of PERKl 
20 induction with respect to changes in levels of phosphorylation provides additional evidence 
for the unequivocal role of PERKl in plant defense. Furthermore, transgenic analysis of 
plants expressing altered levels of PERKl confirms the involvement of PERKl in wound and 
pathogen signaling. PERKl polypeptides and nucleic acid molecules may be isolated from 
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the Brassicaceae including Arabidopsis, Brassica napus, Brassica rapa, Brassica juncea, 
Brassica oleracea, and other plants such as potato, tomato, tobacco, cotton, carrot, petunia, 
sunflower, strawberries, spinach, lettuce, rice, soybean, com, wheat, rye, barley, sorgum and 
alfalfa. 

5 PERK Nucleic Acid Molecules and Polypeptides 

The invention relates to PERK nucleic acid molecules and polyp^tides which 
increase wounding resistance and pathogen resistance in cells and plants. These polypeptides 
preferably include an extracellular domain, a transmembrane domain and a cytoplasmic 
domain. The cytoplasmic domain preferably includes a region with kinase activity. The 
1 0 kinase activity is involved in cellular signaling. The PERK nucleic acid molecules which 

encode PERK polypeptides are particularly useful for producing transgenic plants which have 
increased wounding and pathogen resistance compared to a wild type plant. 

It will also be apparent that there are polypeptide and nucleic acid molecules from 
other organisms, such as those listed previously, that are similar to PERK polypeptides and 

15 nucleic acid molecules. PERK polypeptides are useful in increasing wounding and pathogen 
resistance in a cell, preferably a plant cell, because they include extensin-like and proline-rich 
domains (this refers to a plurality of domains including multiple proline residues, which are 
preferably similar to those found in extensins), such as SPPPP, SPP and PPP which are 
capable of being hydroxylated in response to wounding or pathogen attack. Once 

20 hydroxylated, extensins become r^idly insoluble which strengthens the cell wall and in 
response to pathogen attack helps agglutinate or prevent the spread of the pathogen to 
neighbouring plant cells. 

The PERK nucleic acid molecules and polypeptides, as well as their role in plants 
were not known before this invention. The ability of these compounds to increase wounding 
25 and pathogen resistance of transgenic host cells (particularly plant cells) and transgenic plants 
compared to wild type cells and plants was unknown. 

All nucleotides and polypeptides which are suitable for use in the methods of the 
invention, such as the preparation of transgenic host cells or transgenic plants, are included 
within the scope of the invention. Genomic clones or cDNA clones are preferred for 
30 preparation of transgenic cells and plants. 
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In a preferred embodiment, the invention relates to a cDNA encoding PERK 
polypeptides from Brassica napus. The cDNA sequence and the corresponding amino acid 
sequence for PERKl is presented in Figure 1 . The invention also includes splice variants of 
the nucleic acid molecules as well as polypeptides produced from the molecules. 

5 Characterization of Nucleic Acid Molecules and Polypeptides 

In one variation, the invention includes DNA sequences (and the corresponding 
polypeptide) including at least one of the sequences shown in figure 1 in a nucleic acid 
molecule of preferably about : less than 1000 base pairs, less than 1250 base pairs, less than 
1500 base pairs, less than 1750 base pairs, less than 2000 base pairs, less than 2250 base 
10 pairs, less than 2500 base pairs, less than 2750 base pairs or less than 3000 base pairs. 

The coding region of the PERKl nucleic acid molecule is as follows: 



Table 2 



Nucleic Acid Molecule 


Start Nucleotide 
[brackets show 
corresponding amino 
acid nos.] 


End Nucleotide 
[brackets show 
corresponding 
amino acid nos.] 


PERKl (coding region 
only) 


1(1) 


1944 (648) 


PERKl Extracellular 
Domain 


1(1) 


411 (137) 


PERKl Transmembrane 
Domain 


412 ( 138) 


480 (160) 


PERKl Cytoplasmic 
Domain 


481 (161) 


1944 (648) 


PERKl Kinase region 


Same as 
cytoplasmic 
domain 


Same as 
cytoplasmic 
domain 



It will be apparent that these may be varied, for example, by shortening the 5* 
1 5 untranslated region or shortening the nucleic acid molecule so that the 3' end nucleotide is in 
a different position. 
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The discussion of the nucleic acid molecules, sequence identity, hybridization and 
other aspects of nucleic acid molecules included within the scope of the invention is intended 
to be applicable to either the entire nucleic acid molecule in figure 1 or the coding region of 
this molecule, shown in Table 2. One may use the entire molecule in figure 1 or only the 
5 coding region. Other possible modifications to the sequence are apparent. 

Southern Blot Analysis shows that PERKl is present as a single copy gene in 
Brassica, A Northern blot showed that PERKl polypeptide was expressed in all tissues 
examined (root, stem, leaf, petal, anther and pistil ). It is highly expressed in the stem, petal 
and pistil tissues and to a lower extent in the root, leaf and anther tissues . 

10 The PERKl Nucleic Acid Molecule and Polypeptide are Conserved in Plants 

Sequence Identity 

This is the first isolation of a nucleic acid molecule encoding a PERK polypeptide 
fix)m plant species. Nucleic acid sequences having sequence identity to the PERKl sequence 
are found in other species of Brassica such as Brassica rapa, Brassica juncea, and Brassica 

1 5 oleracea as well as other plants such as Arabidopsis, potato, tomato, tobacco, cotton, carrot, 
petunia, sunflower, strawberries, spinach, lettuce, rice, soybean, com, wheat, rye, barley, 
sorgum and alfalfa. Sequences fixim Brassica napus and other plants are collectively referred 
to as 'TERK" nucleic acid sequences and polypeptides. We isolate PERK nucleic acid 
molecules fi-om plants having nucleic acid molecules that are similar to those in Brassica 

20 napus, such as beet, tomato, rice, cucumber, radish and other plants including Arabidopsis, 
potato, tobacco, cotton, carrot, petunia, simflower, strawberries, spinach, lettuce, soybean, 
com, wheat, rye, barley, sorgum and alfalfa, and using techniques described in this 
application. The invention includes methods of isolating these nucleic acid molecules and 
polypeptides as well as methods of using these nucleic acid molecules and polypeptides 

25 according to the methods described in this application, for example those used with respect to 
PERKl. 

Table 3 below shows several sequences with sequence identity and sequence 
similarity to the PERKl polypeptide. Where polypeptides are shown, a suitable 
corresponding DNA encoding the polypeptide will be apparent. These sequences code for 
30 polypeptides similar to portions of PERKl polypeptide. The sequences in Table 3 are usefiil 
to make probes to identify fiill length sequences or firagments (fi-om the listed species or other 
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Species). They are useful to screen for functionally related cDNAs and genes. They are also 
useful to screen other tissues to see if they include all or part of the shown EST or similar 
sequences. The invention also relates to nucleic acid molecules including these EST 
sequences, hi particular, the invention includes an isolated nucleic acid comprising a 
5 sequence that hybridizes under moderate or stringent conditions to SEQ ID NOS: 1 or 3 or the 
complements thereof. The invention also includes an isolated nucleic acid molecule 
comprising a sequence having at least about: >60%, >70%, >80% or >90%, more preferably 
at least about: >95%, or >99% sequence identity to a sequence in Table 3* One skilled in the 
art would be able to design a probe based on a polypeptide or peptide fi:agment. The 

1 0 invention includes nucleic acid molecules of about: 10 to 50 nucleotides, 50 to 200 

nucleotides, 200 to 500 nucleotides, 500 to 1000 nucleotides, 1000 to 1500 nucleotides, 1500 
to 1700 nucleotides, 1700 to 2000 nucleotides, 2000 to 2500 nucleotides or at least 2500 
nucleotides and which include all or part of the sequences (or corresponding nucleic acid 
molecule) in Table 3. The invention also includes a nucleic acid molecule including the 

15 sequences in Table 3 which encodes peptides and polypeptides of about: 10 to 50 amino 

acids, 50 to 200 amino acids, 200 to 500 amino acids, 500 to 750 amino acids or at least 750 
amino acids. Possible modifications to these sequences will also be apparent. The 
polypeptide and nucleic acid molecules are also useful in research experiments or in 
bioinformatics to locate other sequences. The nucleic acid molecules and polypeptides 

20 preferably provide PERK activity. 



Table 3 



Organism 


Accession No. 


Arabidopsis thaliana 


AAC 98010 (Figure 11) 


Arabidopsis thaliana 


AAD 15491 (Figure 12) 


Arabidopsis thaliana 


CAA 18823 (Figure 13) 


Arabidopsis thaliana 


CAA 18590 (Figure 14) 



The regions of importance include the extracellular domain (ECD), transmembrane domain 
25 (TMD), and the catalytic domain Amino acid positions are as follows: 

AAC98010 - ECD: 1-247; TMD: 248-267; CD: 268 -732 
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AAD15491. ECD:1- 149 ; TMD - 150-171; CD - 172 -634 

CAA18823 - ECD:1- 179 ; TMD - 180.194; CD - 195 -675 

CAA18590 - ECD:1- 279 ; TMD - 280-302; CD - 303 -732 

The invention includes the nucleic acid molecules from other plants as well as 
5 methods of obtaining the nucleic acid molecules by, for example, screening a cDNA library 
or other DNA collections with a probe of the invention (such as a probe comprising at least 
about: 10 or preferably at least 15 or 30 nucleotides of PERKl and detecting the presence of a 
PERK nucleic acid molecule. Another method involves comparing the PERKl sequences to 
other sequences, for example using bioinformatics techniques such as database searches or 
1 0 alignment strategies, and detecting the presence of a PERK nucleic acid molecule or 

polypeptide. The invention includes the nucleic acid molecule and/or polypeptide obtained 
according to the methods of the invention. The invention also includes methods of using the 
nucleic acid molecules, for example to make probes, in research experiments or to transform 
host cells or make transgenic plants. These methods are as described below. 
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The polypeptides encoded by the PERK nucleic acid molecules in other species will 
have amino acid sequence identity to the PERKl sequence. Sequence identity may be at least 
about: >20%, >25%, >28%, >30%, >35%, >40%, >50% to an amino acid sequence shown in 
figure 1 (or a partial sequence tiiereof). Some polypeptides may have a sequence identity of 
5 at least about: >60%, >70%, >80% or >90%, more preferably at least about: >95%, >99% or 
>99.5% to an amino acid sequence in figure 1 (or a partial sequence thereof). Identity is 
calculated according to methods known in the art. Sequence identity (nucleic acid and 
protein) is most preferably assessed by the algorithm of the Fasta 3 program , using the 
following default parameter settings: gap penalty (open) = -12 (protein) -16 (DNA), gap 

10 penalty (extension) = -2 (protein) -4 (DNA) , protein weight matrix = BLOSUM 62. (The 
reference for FASTA 3 is W. R. Pearson and D. J. Lipman (1988), "Improved Tools for 
Biological Sequence Analysis", PNAS 85:2444- 2448,and W. R. Pearson (1990) "Rapid and 
Sensitive Sequence Comparison with FASTP and FASTA" Methods in Enzymology 183:63- 
98). . The invention also includes modified polypeptide fi-om plants which have sequence 

15 identity at least about: >20%, >25%, >28%, >30%, >35%, >40%, >50%, >60%, >70%, 
>80% or >90% more preferably at least about >95%, >99% or >99.5%, to the PERK 
sequence in figure 1 (or a partial sequence thereof). Modified polypeptide molecules are 
discussed below. Preferably about: 1, 2, 3, 4, 5, 6 to 10, 10 to 25» 26 to 50 or 51 to 100, or 
101 to 250 nucleotides or amino acids are modified. 

20 Nucleic Acid Molecules and Polypeptides Similar to PERKl 

Those skilled in the art will recognize that the nucleic acid molecule sequences in 
figure 1 are not the only sequences which may be used to provide increased PERK activity in 
plants. The genetic code is degenerate so other nucleic acid molecules which encode a 
polypeptide identical to an amino acid sequence in figure 1 may also be used. The sequence 
25 of the other nucleic acid molecules of this invention may also be varied without changing the 
polypeptide encoded by the sequence. Consequently, the nucleic acid molecule constructs 
described below and in the accompanying examples for the preferred nucleic acid molecules, 
vectors, and transformants of the invention are merely illustrative and are not intended to 
limit the scope of the invention. 

30 The sequences of the invention can be prepared according to numerous techniques. 

The invention is not limited to any particular preparation means. For example, the nucleic 
acid molecules of the invention can be produced by cDNA cloning, genomic cloning, cDNA 
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synthesis, polymerase chain reaction (PCR), or a combination of these approaches ( Current 
Protocols in Molecular Biology (F. M. Ausbel et aL, 1989).). Sequences may be synthesized 
using well known methods and equipment, such as automated synthesizers. Nucleic acid 
molecules may be amplified by the polymerase chain reaction. Polypeptides may, for 
5 example, be synthesized or produced recombinantly. 

Sequence Identity 

The invention includes modified nucleic acid molecules with a sequence identity at 
least about: >1 7%, >20%, >30%, >40%, >50%, >60%, >70%, >80% or >90% more 
preferably at least about >95%, >99% or >99.5%, to a DNA sequence in figure 1 (or a partial 

10 sequence thereof). Preferably about 1, 2, 3, 4, 5, 6 to 10, 10 to 25, 26 to 50 or 51 to 100, or 
101 to 250 nucleotides or amino acids are modified. Identity is calculated according to 
methods known in the art. Sequence identity is most preferably assessed by the algorithm of 
the FASTA 3 program. For example, if a nucleotide sequence (called "Sequence A") has 
90% identity to a portion of the nucleotide sequence in Figure 1, then Sequence A will be 

1 5 identical to the referenced portion of the nucleotide sequence in Figure 1 , except that 
Sequence A may include up to 10 point mutations, such as substitutions with other 
nucleotides, per each 100 nucleotide of the referenced portion of the nucleotide sequence in 
Figure 1 . Nucleotide sequences fimctionally equivalent to the PERKl sequence can occur in 
a variety of forms as described below. Polypeptides having sequence identity may be 

20 similarly identified. 

The polypeptides encoded by the homologous PERK nucleic acid molecule in other 
species will have amino acid sequence identity at least about: >20%, >25%, >28%, >30%, 
>40% or >50% to an amino acid sequence shown in figure 1 (or a partial sequence thereof). 
Some plant species may have polypeptides with a sequence identity of at least about: >60%, 
25 >70%, >80% or >90%, more preferably at least about: >95%, >99% or >99.5% to all or part 
of an amino acid sequence in figure 1 (or a partial sequence thereof). Identity is calculated 
according to methods known in the art. Sequence identity is most preferably assessed by the 
FASTA 3 program. Preferably about: 1, 2, 3, 4, 5, 6 to 10, 10 to 25, 26 to 50 or 51 to 100, or 
101 to 250 nucleotides or amino acids are modified. 



24 



wo 01/14563 



PCT/CAOO/00966 



The invention includes nucleic acid molecules with mutations that cause an amino 
acid change in a portion of the polypeptide not involved in providing PERK activity or an 
amino acid change in a portion of the polypeptide involved in providing PERK activity so 
that the mutation increases or decreases the activity of the polypeptide. 

5 Hybridization 

Other functional equivalent forms of the PERK nucleic acid molecules encoding 
nucleic acids can be isolated using conventional DNA-DNA or DNA-RNA hybridization 
techniques. These nucleic acid molecules and the PERK sequences can be modified without 
significantly affecting their activity. 

1 0 The present invention also includes nucleic acid molecules that hybridize to one or 

more of the sequences in figure 1 (or a partial sequence thereof) or their complementary 
sequences, and that encode peptides or polypeptides exhibiting substantially equivalent 
activity as that of an PERK polypeptide produced by the DNA in figure 1 . Such nucleic acid 
molecules preferably hybridize to all or a portion of PERK or its complement or all or a 

1 5 portion of an EST of Table 3 under low, moderate (intermediate), or high stringency 

conditions as defined herein (see Sambrook et al. (Most recent edition) Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.; 
Ausubel et al. (eds.), 1995, Current Protocols in Molecular Biology, (John Wiley & Sons, 
NY)). The portion of the hybridizing nucleic acids is typically at least 15 (e.g. 20, 25, 30 or 

20 50) nucleotides in length. The hybridizing portion of the hybridizing nucleic acid is at least 
80% e.g. at least 95% or at least 98% identical to the sequence or a portion or all of a nucleic 
acid encoding a PERK polypeptide, or its complement. Hybridizing nucleic acids of the type 
described herein can be used, for example, as a cloning probe, a primer (e.g. a PCR primer) or 
a diagnostic probe. Hybridization of the oligonucleotide probe to a nucleic acid sample 

25 typically is performed under stringent conditions. Nucleic acid duplex or hybrid stability is 
expressed as the melting temperature or Tm, which is the temperature at which a probe 
dissociates firom a target DNA. This melting temperature is used to define the required 
stringency conditions. If sequences are to be identified that are related and substantially 
identical to the probe, rather than identical, then it is useful to first establish the lowest 

30 temperature at which only homologous hybridization occurs with a particular concentration of 
salt (e.g. SSC or SSPE). Then, asstmiing that 1% mismatching results in a 1 degree Celsius 
decrease in the Tm, the temperature of the final wash in the hybridization reaction is reduced 

25 
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accordingly (for example, if sequences having greater than 95% identity with the probe are 
sought, the final wash temperature is decreased by 5 degrees Celsius). In practice, the change 
in Tm can be between 0.5 degrees Celsius and 1.5 degrees Celsius per 1% mismatch. Low 
stringency conditions involve hybridizing at about: 2XSSC, 0.1% SDS at SO^'C. High 
5 stringency conditions are: O.IXSSC, 0.1% SDS at 65**C. Moderate stringency is about IX 
SSC 0.1% SDS at 60 degrees Celsius. The parameters of salt concentration and temperature 
can be varied to achieve the optimal level of identity between the probe and the target nucleic 
acid. 

The present invention also includes nucleic acid molecules from any source, whether 
1 0 modified or not, that hj^ridize to genomic DNA, cDNA, or synthetic DNA molecules that 
encode the amino acid sequence of a PERK polypeptide, or genetically degenerate forms, 
under salt and temperature conditions equivalent to those described in this appUcation, and 
that code for a peptide, or polypeptide that has PERK activity. Preferably the polypeptide has 
the same or similar activity as that of a PERKl polypeptide. A nucleic acid molecule 
1 5 described above is considered to be fimctionally equivalent to a PERK nucleic acid molecule 
(and thereby having PERK activity) of the present invention if the polypeptide produced by 
the nucleic acid molecule displays the following characteristics: The defining feature of 
PERK polypeptides is the presence of a proline-rich domain, followed by a transmembrane 
domain, followed by a kinase domain. When tested, the kinase domain has serine/threonine 
20 kinase activity. 

The invention also includes nucleic acid molecules and polyp^tides having sequence 
similarity taking into account conservative amino acid substitutions. Sequence similarity 
(and preferred percentages) are discussed below. 

Modifications to Nucleic Acid Molecule or Polypeptide Sequence 
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Changes in the nucleotide sequence which result in production of a chemically 
equivalent or chemically similar amino acid sequences are included within the scope of the 
invention. Variants of the polypeptides of the invention may occur naturally, for example, by 
mutation, or may be made, for example, with polypeptide engineering techniques such as site 
5 directed mutagenesis, which are well known in the art for substitution of amino acids. For 
example, a hydrophobic residue, such as glycine can be substituted for another hydrophobic 
residue such as alanine. An alanine residue may be substituted with a more hydrophobic 
residue such as leucine, valine or isoleucine. A negatively charged amino acid such as 
aspartic acid may be substituted for glutamic acid. A positively charged amino acid such as 
1 0 lysine may be substituted for another positively charged amino acid such as arginine. 

Therefore, the invention includes polypeptides having conservative changes or 
substitutions in amino acid sequences. Conservative substitutions insert one or more amino 
acids which have similar chemical properties as the replaced amino acids. The invention 
includes sequences where conservative substitutions are made that do not destroy PERK 
1 5 activity. The preferred percentage of sequence similarity for sequences of the invention 

includes sequences having at least about: 50% similarity to PERKl. The similarity may also 
be at least about: 60% similarity, 75% similarity, 80% similarity, 90% similarity, 95% 
similarity, 97% similarity, 98% similarity, 99% similarity, or more preferably at least about 
99.5% similarity, wherein the polypeptide has PERK activity. The invention also includes 
20 nucleic acid molecules encoding polypeptides, with the polypeptides having at least about: 
50% similarity to PERKl . The similarity may also be at least about: 60% similarity, 75% 
similarity, 80% similarity, 90% similarity, 95% similarity, 97% similarity, 98% similarity, 
99% similarity, or more preferably at least about 99.5% similarity, wherein the polypeptide 
has PERK activity, to an amino acid sequence in figure 1 (or a partial sequence thereof) 
25 considering conservative amino acid changes, wherein the polypeptide has PERK activity. 
Sequmce similarity is preferably calculated number of similar amino acids in a multiple 
alignment expressed as a percentage of the shorter of the two sequences in the alignment. The 
multiple alignment is preferably constructed using the algorithm of the FASTA 3 program, 
using the following parameter settings: gap penalty (open) = -12(protein) -16 (DNA), gap 
30 penalty (extension) = -2 (protein) -4 (DNA) , protein weight matrix = BLOSUM 62. (The 
reference for FASTA 3 is W. R. Pearson and D. J. Lipman (1988), "Improved Tools for 
Biological Sequence Analysis", PNAS 85:2444- 2448,and W. R. Pearson (1990) "Rapid and 
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Sensitive Sequence Comparison with FASTP and FASTA" Methods in Enzymology 183:63- 
98). 

Polypeptides comprising one or more d-amino acids are contemplated within the 
invention. Also contemplated are polypeptides where one or more amino acids are acetylated 
5 at the N-terminus. Those of skill in the art recognize that a variety of techniques are available 
for constructing polypeptide mimetics with the same or similar desired PERK activity as the 
corresponding polypeptide compound of the invention but with more favorable activity than 
the polypeptide with respect to solubility, stability, and/or susceptibility to hydrolysis and 
proteolj^is. See, for example, Morgan and Gainor, Ann. Rep. Med. Chem., 24:243-252 

10 (1989). Examples of polypeptide mimetics are described in U.S. Patent Nos. 5,643,873. 

Other patents describing how to make and use mimetics include, for example in, 5,786,322, 
5,767,075, 5,763,571, 5,753,226, 5,683,983, 5,677,280, 5,672,584, 5,668,110, 5,654,276, 
5,643,873. Mimetics of the polypeptides of the invention may also be made according to 
other techniques known in the art. For example, by treating a polypeptide of the invention 

1 5 with an agent that chemically alters a side group by converting a hydrogen group to another 
group such as a hydroxy or amino group. Mimetics preferably include sequences that are 
either entirely made of amino acids or sequences that are hybrids including amino acids and 
modified amino acids or other organic molecules. 

The invention also includes hybrid nucleic acid molecules and polypeptides, for 
20 example where a nucleotide sequence from one species of plant is combined with a 

nucleotide sequence from another sequence of plant, mammal, bacteria or yeast to produce a 
fiision polypeptide. The invention includes a frision protein having at least two components, 
wherein a first component of the fusion protein comprises a polypeptide of the invention, 
preferably a fiill length PERK polypeptide. The second component of the fusion protein 
25 preferably comprises a tag, for example GST, an epitope tag or an enzyme. The fusion 
protein may comprise lacZ. 

The invention also includes polypeptide fragments of the polypeptides of the 
invention which may be used to confer PERK activity if the fragments retain activity. The 
invention also includes polypeptides fragments of the polypeptides of the invention which 
30 may be used as a research tool to characterize the polypeptide or its activity. Such 

polypeptides preferably consist of at least 5 amino acids. In preferred embodiments, they may 
consist of 6 to 10, 1 1 to 15, 16 to 25, 26 to 50, 51 to 75,76 to 100 or 101 to 250 amino acids 
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of the polypeptides of the invention (or longer amino acid sequences). The fragments 
preferably have PERK activity. Fragments may include sequences with one or more amino 
acids removed, for example, C-terminus amino acids in a PERK sequence. 

The invention also includes a composition comprising all or part of an isolated PERK 
5 nucleic acid molecule (preferably PERKl) of the invention and a carrier, preferably in a 

composition for plant transformation. The invention also includes a composition comprising 
an isolated PERK polypeptide (preferably PERKl) and a carrier, preferably for studying 
polypeptide activity. 

Recombinaiit Nucleic Acid Molecules 

1 0 The invention also includes recombinant nucleic acid molecules preferably a PERKl 

sequence of figure 1 comprising a nucleic acid molecule of the invention and a promoter 
sequence, operatively linked so that the promoter enhances transcription of the nucleic acid 
molecule in a host cell (the nucleic acid molecules of the invention may be used in an isolated 
native gene or a chimeric gene, for example, where a nucleic acid molecule coding region is 

1 5 connected to one or more heterologous sequences to form a gene. The promoter sequence is 
preferably a constitutive promoter sequence or an inducible promoter sequence, operatively 
linked so that the promoter enhances transcription of the DNA molecule in a host cell. The 
promoter may be of a type not naturally associated with the cell such as a super promoter, a 
35 S cauliflower mosaic vims promoter, a chemical inducible promoter, a copper-inducible 

20 promoter, a steroid-inducible promoter and a tissue specific promoter . 
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A recombinant nucleic acid molecule for conferring PERK activity may also contain 
suitable transcriptional or translational regulatory elements. Suitable regulatory elements may 
be derived from a variety of sources, and they may be readily selected by one with ordinary 
skill in the art. Examples of regulatory elements include: a transcriptional promoter and 
5 enhancer or RNA polymerase binding sequence, a ribosomal binding sequence, including a 
translation initiation signal- Additionally, depending on the vector employed, other genetic 
elements, such as selectable markers, may be incorporated into the recombinant molecule. 
Markers facilitate the selection of a transformed host cell. Such markers include genes 
associated with temperature sensitivity, dmg resistance, or enzymes associated with 
1 0 phenotypic characteristics of the host organisms. 

Nucleic acid molecule expression levels are controlled with a transcription initiation 
region that regulates transcription of the nucleic acid molecule or nucleic acid molecule 
fragment of interest in a plant, bacteriaor yeast cell. The transcription initiation region may 
be part of the constmct or the expression vector. The transcription initiation domain or 
1 5 promoter includes an RNA polymerase binding site and an mRNA initiation site. Other 
regulatory regions that may be used include an enhancer domain and a termination region. 
The regulatory elements described above may be from animal, plant, yeast, bacteria, fringus, 
vims or other soiirces, including synthetically produced elements and mutated elements. 

Methods of modifying DNA and polypeptides, preparing recombinant nucleic acid 
20 molecules and vectors, transformation of cells, expression of polypeptides are known in the 
art. For guidance, one may consult the following US patent nos. 5,840,537, 5,850,025, 
5,858,719, 5,710,018, 5,792,851, 5,851,788, 5,759,788, 5,840,530, 5,789,202, 5,871,983, 
5,821,096, 5,876,991, 5,422,108, 5,612,191, 5,804,693, 5,847,258, 5,880,328, 5,767,369, 
5,756,684, 5,750,652, 5,824,864, 5,763,21 1, 5,767,375, 5,750,848, 5,859,337, 5,563,246, 
25 5,346,815, and W097 13843. Many of these patents also provide guidance with respect to 
experimental assays, probes and antibodies, methods, transformation of host cells and 
regeneration of plants, which are described below. These patents, like all other patents, 
publications (such as articles and database pubUcations) in this application, are incorporated 
by reference in their entirety. 
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Host Cells Including a PERK Nucleic Acid Molecule 

In a preferred embodiment of the invention, a plant or yeast cell is transformed with a 
nucleic acid molecule of the invention or a fragment of a nucleic acid molecule and inserted 
in a vector. 

Another embodiment of the invention relates to a method of transforming a host cell 
with a nucleic acid molecule of the invention or a fragment of a nucleic acid molecule, 
inserted in a vector. The invention also includes a vector comprising a nucleic acid molecule 
of the invention. The nucleic acid molecules can be cloned into a variety of vectors by means 
that are well known in the art. The recombinant nucleic acid molecule may be inserted at a 
site in the vector created by restriction enzymes. A number of suitable vectors may be used, 
including cosmids, plasmids, bacteriophage, baculoviruses and viruses. Suitable vectors are 
capable of reproducing themselves and transforming a host cell. The invention also relates to 
a method of expressing polypeptides in the host cells. A nucleic acid molecule of the 
invention may be used to transform virtually any type of plant, including both monocots and 
dicots. The expression host may be any cell capable of expressing PERK, such as a cell 
selected from the group consisting of a seed (where appropriate), plant cell, bacterium, yeast, 
fungus, protozoa, algae, animal and animal cell. 

Levels of nucleic acid molecule expression may be controlled with nucleic acid 
molecules or nucleic acid molecule fragments that code for anti-sense RNA inserted in the 
vectors described above. 

Agrobacterium tumefaciens-mGdiatcd transformation, particle-bombardment-mediated 
transformation, direct uptake, microinjection, coprecipitation and electroporation-mediated 
nucleic acid molecule transfer are usefiil to transfer a PERK nucleic acid molecule into seeds 
(where appropriate) or host cells, preferably plant cells, depending upon the plant species. 
The invention also includes a method for constructing a host cell capable of expressing a 
nucleic acid molecule of the invention, the method comprising introducing into said host cell 
a vector of the invention. The genome of the host cell may or may not also include a 
functional PERK gene. The invention also includes a method for expressing a PERK 
polypeptide such as a PERKl in figure 1 in the host cell or a plant, plant part, seed or plant 
cell of the invention, the method comprising culturing the host cell under conditions suitable 
for gene expression. The method preferably also includes recovering the expressed 
polypeptide from the culture. 
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The invention includes the host cell comprising the recombinant nucleic acid 
molecule and vector as well as progeny of the cell. Preferred host cells are fungal cells, yeast 
cells, bacterial cells, mammalian cells, bird cells, reptile cells, amphibious cells, 
microorganism cells and plant cells. Host cells may be cultured in conventional nutrient 
5 media. The media may be modified as appropriate for inducing promoters, amplifying genes 
or selecting transformants. The culture conditions, such as temperature, composition and pH 
will be apparent. After transformation, transformants may be identified on the basis of a 
selectable phenotype. A selectable phenotype can be conferred by a selectable marker in the 
vector. 

10 Transgenic Plants and Seeds 

Plant cells are useful to produce tissue cultures, seeds or whole plants. The invention 
includes a plant, plant part, seed, or progeny thereof including a host cell transformed with a 
PERK nucleic acid molecule such as a molecule in figure 1. The plant part is preferably a 
leaf, a stem, a flower, a root, a seed or a tuber. 

1 5 The invention includes a transformed (transgenic) plant having increased PERK 

activity, the transformed plant containing a nucleic acid molecule sequence encoding for 
polypeptide activity and the nucleic acid molecule sequence having been introduced into the 
plant by transformation under conditions whereby the transformed plant expresses a PERK 
polypeptide in an active form. 

20 The methods and reagents for producing mature plants fi^om cells are knovm in the art. 

The invention includes a method of producing a genetically transformed plant which 
expresses PERK polypeptide such as a polypeptide in figure 1 by regenerating a genetically 
transformed plant fi-om the plant cell, seed or plant part of the invention. The invention also 
includes the transgenic plant produced according to the method. Alternatively, a plant may be 

25 transformed with a vector of the invention. 

The invention also includes a method of preparing a plant with increased PERK 
activity, the method comprising transforming the plant with a nucleic acid molecule which 
encodes a polypeptide of figure 1 or a polypeptide encoding a PERK polypeptide capable of 
increasing PERK activity in a cell, and recovering the transformed plant with increased PERK 
30 activity. The invention also includes a method of preparing a plant with increased PERK 

activity, the method comprising transforming a plant cell with a nucleic acid molecule such as 
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a molecule of figure 1 which encodes a PERK polypeptide capable of increasing PERK 
activity in a cell. 

Overexpression of PERK leads to an improved ability of the transgenic plants to resist 
wounding or pathogen damage. 

5 The plants whose cells may be transformed with a nucleic acid molecule of this 

invention and used to produce transgenic plants include, but are not limited to the 

Target plants : Brassica napus, Brassica rapa, Brassica juncea, Brassica oleracea^ or from 
the family Brassicaecae, Arabidopsis, potato, tomato, tobacco, cotton, carrot, petunia, 
simflower, strawberries, spinach, lettuce, rice, soybean, com, wheat, rye, barley, sorgum and 
1 0 alfalfa. Cereal plants including rye, barley and wheat may also be transformed with a PERK 
polypeptide, preferably PERKl. 

In a preferred embodiment of the invention, plant tissue cells or cultures which 
demonstrate PERK activity (or increased PERK activity compared to wild type) are selected 
and plants are regenerated from these cultures. Methods of regeneration will be apparent to 
1 5 those skilled in the art (see Examples below, also). These plants may be reproduced, for 

example by cross polUnation with a plant that does not have PERK activity. If the plants are 
self-pollinated, homozygous progeny may be identified from the seeds of these plants, for 
example, using genetic markers. Seeds obtained from the mature plants resulting from these 
crossings may be planted, grown to sexual maturity and cross-pollinated or self-pollinated. 

20 The nucleic acid molecule is also incorporated in some plant species by breeding 

methods such as back crossing to create plants homozygous for the PERK nucleic acid 
molecule. 

A plant line homozygous for the PERK nucleic acid molecule may be used as eithCT a 
male or female parent in a cross with a plant line lacking the PERK nucleic acid molecule to 
25 produce a hybrid plant line which is uniformly heterozygous for the nucleic acid molecule. 
Crosses between plant lines homozygous for the PERK nucleic acid molecule are used to 
generate hybrid seed homozygous for the resistance nucleic acid molecule. 

Fragments/Probes 

Preferable fragments include 10 to 50, 50 to 100, 100 to 250, 250 to 500, 500 to 1000, 
30 1000 to 1500, or 1500 or more nucleotides of a nucleic acid molecule of the invention. A 
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fragment may be generated by removing a single nucleotide from a sequence in figure 1 (or a 
partial sequence thereof). Fragments may or may not encode a polypeptide having PERK 
activity. 

The nucleic acid molecules of the invention (including a fragment of a sequence in 
5 figure 1 (or a partial sequence thereof) can be used as probes to detect nucleic acid molecules 
according to techniques known in the art (for example, see US patent nos. 5,792,851 and 
5,851,788). The probes may be used to detect nucleic acid molecules that encode 
polypeptides similar to the polypeptides of the invention. For example, a probe having at 
least about 10 bases will hybridize to similar sequences under stringent hybridization 
10 conditions (Sambrook et al. 1989, Molecular Cloning, A Laboratory Manual, Cold Spring 

Harbor). Polypeptide fragments of PERKl are preferably at least 8 amino acids in length and 
are useful, for example, as immunogens for raising antibodies that will bind to intact protein 
(immunogenic fragments). Typically the average length used for synthetic peptides is 8-16, 8 
being the minimiun, however 12 amino acids is commonly used. 

15 Kits 

The invention also includes a kit for conferring increased PERK activity to a plant or a 
host cell including a nucleic acid molecule of the invention (preferably in a composition of 
the invention) and preferably reagents for transforming the plant or host cell. 

The invention also includes a kit for detecting the presence of PERK nucleic acid 
20 molecule (e.g. a molecule in figure 1), comprising at least one probe of the invention. Kits 
may be prepared according to known techniques, for example, see patent nos. 5,851,788 and 
5,750,653. 

Antibodies 

The invention includes an isolated antibody immunoreactive with a polypeptide of the 
25 invention. Antibodies are preferably generated against epitopes of native PERKl or synthetic 
peptides of PERKl . The antibody may be labeled with a detectable marker or unlabeled. The 
antibody is preferably a monoclonal antibody or a polyclonal antibody. PERK antibodies can 
be employed to screen organisms containing PERK polypeptides. The antibodies are also 
valuable for immuno-purification of polypeptides from cmde extracts. 

30 Examples of the preparation and use of antibodies are provided in US Patent Nos. 

5,792,85 1 and 5,759,788. For other examples of methods of the preparation and uses of 
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monoclonal antibodies, see US Patent Nos. 5,688,681, 5,688,657, 5,683,693, 5,667,781, 
5,665,356, 5,591,628, 5,510,241, 5,503,987, 5,501,988, 5,500,345 and 5,496,705. Examples 
of the preparation and uses of polyclonal antibodies are disclosed in US Patent Nos. 
5,512,282, 4,828,985, 5,225,331 and 5,124,147. 

5 The invention also includes methods of using the antibodies. For example, the 

invention includes a method for detecting the presence of a PERK polypeptide such as 
PERKl, by: a) contacting a sample containing one or more polypeptides with an antibody of 
the invention under conditions suitable for the binding of the antibody to polypeptides with 
which it is specifically reactive; b) separating imbound polypeptides from the antibody; and c) 
10 detecting antibody which remains bound to one or more of the polypeptides in the sample. 

Research Tool 

Cell cultures, seeds, plants and plant parts transfomied with a nucleic acid molecule of 
the invention are useful as research tools. For example, one may obtain a plant cell (or a cell 
line,) that does not express PERKl, insert a PERKl nucleic acid molecule in the cell, and 
15 assess the level of PERKl expression and activity. 

The PERK nucleic acid molecules and polypeptides including those in figure 1 are 
also useful in assa>^. Assays are useful for identification and development of compoimds to 
inhibit and/or enhance polypeptide function directly. 

Suitable assays may be adapted from, for example, US patent no. 5,851,788 

20 Using Exogenous Agents in Combination with a Vector 

The nucleic acid molecules of the invention may be used with other nucleic acid 
molecules that relate to plant protection, for example, extensin nucleic acid molecules. Host 
cells or plants may be transformed with these nucleic acid molecules. 

PERKl ACTIVITY 

25 We show that PERKl encodes a protein with kinase activity as its sequence predicts. 

The bacterially expressed catalytic domain fusion protein of PERKl is tested for kinase 
activity (Figure 9c). Furthermore, to ensure that the phosphorylation of the fusion protein 
was not a result of bacterial kinase activity, a mutated catalytic domain was also generated 
by site directed mutagenesis which introduced a single base pair substitution of a lysine 

30 residue to a glutamic acid residue (K->E). This mutation modifies the essential invariant 
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lysine of subdomain n required for phospho-transfer and renders the kinase inactive. Both 
the wild-type and the mutated catalytic domains of PERKl were cloned into the pMAL-c 
expression system, induced for protein production in the presence of IPTG and purified by 
affinity chromatography on MBP amylose resin (Figure 9a)- Figure 9b is a Western blot to 

5 confirai the induction and purification of both the wild-type and mutated catalytic domain 
fiision proteins using an anti-MBP antibody. The wild-type fiision protein appears to be toxic 
in bacteria which compromises its inducibility and purification (Figure9a; lane 2). The 
mutated fiision protein is induced and purified more efficiently, perhaps due to the fact that it 
is no longer kinase active (Figure9a; lane 1). 

1 0 Figure 9c represents a kinase assay performed on affinity purified wild-type and 

mutated fiision proteins incubated in the presence of y-32PdATP, Detection of a 
phosphoprotein only in lane 1 provides direct biochemical evidence that the wild-type 
catalytic domain of PERKl encodes a fimctional protein kinase that is capable of 
autophosphorylation (Lane 1) and that the mutation successfiiUy aboUshed kinase activity 

15 (Lane 2). 

Since PERKl is known to encode a protein with kinase activity, phosphoamino acid 
analysis was performed to determine the amino acid specificity of its autophosphorylating 
activity. The results in figure 9d demonstrate that PERKl is phosphorylated on 
serine/threonine residues and is therefore a serine/threonine receptor kinase. 

20 Transgenic Plants 

An important focus of agricultural biotechnology research is in devising new 
strategies to combat crop losses due to plant diseases and pests. Advances in the 
development and improvement of plant transfomiation techniques has opened up new 
avenues for generating crops with enhanced resistance against disease and insect attack (i.e. 

25 mechanical woimding). Plants can now be engineered to express pathogen derived 

compounds that disrupt the infection process or alternatively a more desirable and perhaps 
effective approach is to enhance a plant's endogenous defense mechanisms. Furthermore, the 
downstream defense mechanisms are fairly well understood, therefore manipulation of some 
potential upstream signals that control the battery of defense proteins may be a promising 

30 strategy to engineer plants with enhanced and broad-spectrum resistance against both insect 
injury and pathogen attack. PERKl fimctions as one such gene. 
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One approach to show the role of a particular gene in the regulation of gene 
expression in response to pathogen attack or mechanical wounding has been to generate 
transgenic plants constitutively expressing the respective cDNA in the sense [1-2] or 
antisense [3] orientations and examining the effects of wounding and pathogen attack on 
5 known downstream target genes involved in these processes. Generating plants (including 
plant parts and seeds) that overexpress the antisense PERKl transcript shows the role of 
PERKl in mediating a plant's defense response to both wounding and pathogen attack. By 
mechanically injuring, inoculating with pathogens (i.e. Sclerotinia sclerotiorum, 
Cylindrosporium concertricum, Phoma lingani) or treating with chemical elicitors and 

1 0 looking at the levels of downstream genes directly involved in these various processes, we 
directly implicate PERKl in these pathways. For example, the expression of the functional 
PERKl protein is abolished and induction of downstream genes is reduced, so PERKl is an 
important upstream component of the pathway. PERKl offers protection against wounding 
and pathogen attack, so plants overexpressing PERKl in the sense orientation exhibit an 

1 5 accumulation of downstream target transcripts involved in these responses and ultimately 
enJianced resistance. 

We score for a phenotype such as enhanced survival of plants overexpressing PERKl 
compared to the enhanced susceptibility of plants overexpressing antisense PERKl 
transcripts in response to pathogen treatment. 

20 (References 1 . Tang, X. et al. (1999). Overexpression of Pto activates defense responses and 
confers broad resistance. Plant Cell 11, 15-29; 2. Cao, H. et al. (1998). Generation of broad- 
spectrum disease resistance by overexpression of an essential regulatory gene in systemic 
acquired resistance. Proc. Natl. Acad. Sci. 95, 6531-6536; 3. Royo, J. et al. (1999). Antisense- 
mediated depletion of a potato lipoxygenase reduces wound induction of proteinase inhibitors 

25 and increases weight gain of insect pests. Proc. Natl. Acad. Sci. 96, 1 146-1 1 5 1 .) 

Transgenic PERKl Brassica napus Plants 

In order to generate transgenic Brassica napus plants expressing altered levels of 
PERKl, the full length PERKl cDNA is cloned into a plant transformation vector in the 
sense orientation downstream of the constitutively active 35S promoter from the cauliflower 
30 mosaic vims (35S CaMV). Subsequently, this expression construct is introduced into an 
Agrobacterium tumefaciens strain in order to transform B. napus plants v\2iAgrobacterium 
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mediated transfonnation. Plant transformants containing the integrated PERKl cDNA are 
selected and tested for the presence of the transgene. 

Upon generating transgenic plants constitutively expressing the PERKl cDNA in the 
sense orientation, we examine the effects of wounding and pathogen attack on known 
5 downstream genes involved in these processes to confirm the involvement of PERKl in 
these pathways. PERKl is an upstream component of these pathways. It is involved in 
mediating a plant's defense response to both wounding and pathogen attack , and transgenic 
plants overexpressing the PERKl cDNA in the sense orientation exhibit an increase in the 
expression of downstream target genes of these pathways. Furthermore, these plants exhibit 
10 an enhanced survival relative to a wild type plant in response to wounding and pathogen 
attack. 

PERKl is a signaling molecule in response to wounding and pathogen attack by 
generating transgenic plants expressing an HA-epitope tagged PERKl protein at either the C 
or N terminus of the protein. Induction of PERKl with respect to changes in levels of 
15 phosphorylation shows the role of PERKl in these processes. The PERKl protein is 

immunoprecipitated using the anti-HA antibody from total protein extracts prepared from B, 
napus tissue after treatment with a specific stimulus. This approach provides quantitative 
results for the levels of PERKl phosphorylation in response to various stimuli. 

EXAMPLES 

20 Isolation and Sequence Analyses of PEKKl cDNA 

In order to isolate novel receptor-like protein kinases in B. napus a combination of 
degenerate oligonucleotide primers designed against conserved kinase subdomains I and Vn 
(Hanks and Quinn, 1991) were used to amplify mass excised phagemid DNA from a newly 
constructed lambda-pistil cDNA library. The cDNAs encoding products of the expected 

25 length (-420-450 bp) were cloned and the deduced partial sequences were analysed against 
several databases in order to determine which clone represented a novel kinase. One of 
several candidates, showed the highest degree of sequence similarity to the cytoplasmic 
domain of known plant receptor protein kinases, and was therefore used to screen the 
amplified lambda-pistil cDNA library. Several positive clones obtained from the Ubrary 

30 screen were completely sequenced and a partial 1 5 12 bp consensus sequence was generated to 
represent the PERKl cDNA isolated from the library screen. Although this partial PERKl 
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cDNA had an open reading frame, it did not encode a full length transcript, therefore the 5' 
end was completed by 5* RACE (see Methods). 

The deduced amino acid sequence of PERKl is shown in Figure 1 A and a schematic 
representation of its hydropathy plot is shown in Figure IB. The full length cDNA sequence 
is 2189 bp and consists of one large open reading frame of 1944 bp encoding a predicted 
protein of 648 amino acids with an estimated molecular mass of 69 kDa (Figure 1). The first 
methionine of this open reading frame is preceded by two in frame stop codons, TAA and 
TGA at positions -48 to -45 and -23 to -21 respectively, hi addition, there is also an AGAA 
sequence at position -9 to -6 (Figure 1) which is a favourable site for translation initiation in 
all eukaryotes (Lutcke et al., 1987). 

PERKl encodes a receptor-like kinase possessing an extracellular domain, a single 
membrane spanning domain and an intracellular kinase domain (Figure IB) with four 
potential N-linked glycosylation sites (Asn-X-Ser/Thr) found throughout the sequence (Figure 
1 A) (Weinstein et al., 1982). The predicted polypeptide sequence was analyzed using the 
PSORT database and determined to be a Type lb intergral membrane protein with a 
hydrophilic amino terminal domain exposed on the exterior of the membrane but whose 
coding sequence does not indicate a cleavable signal sequence preceding this domain. Singer 
(1990) proposes that despite the lack of a signal peptide. Type lb integral membrane proteins 
are inserted into the membrane via the usual ER-translocator protein machinery with some 
slight modifications. The extracellular domain of this protein consists of 137 amino acids 
(Figure 1 A) rich in proline and sharing sequence similarity with extensins, a family of 
hydroxyproline-rich glycoproteins (HRGPs) that constitute a major protein component of 
higher plant cell walls (Showalter, 1993). Extensin proteins have two proposed functions in 
plants, one which contributes to the structural support of the cell wall by forming 
glycoprotein networks and the other which involves plant defense; helping to protect the plant 
against mechanical wounding or patiiogen attack (Wilson and Fry, 1986; Showlater, 1993). A 
distinctive characteristic prevalent among dicot extensins is the repetitive Ser-(Pro)4 
pentapeptide consensus motif (Showalter, 1993). A unique feature of PERKl's extracellular 
domain is the predominance of a slightly modified Ser-(Pro)2.3 motif in addition to the 
presence of one signature pentapeptide block (Figure 1 A). In order to investigate the 
phylogenetic status of PERKl, sixty four deduced amino sequences corresponding to the 
extracellular and transmembrane domains of extensin, proline rich and other cell wall 
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proteins were retrieved from Genbank and used to construct a phylogenetic tree (Clustal X). 
Results from the phylogenetic analysis indicated that PERKl is most similar to a subset of 
extensin proteins as shown by the sequence homology restricted predominantly to the 
serine/proline rich regions of the protein (data not shown). Extensins and proline rich proteins 
5 comprise two major classes of cell wall proteins and are essential for maintaining the proper 
architecture of a plant cell wall as well as important in helping protect plant cells against 
wounding and pathogen invasion. These protein families have been the focus of many 
research efforts and members belonging to these protein classes have been isolated in a wide 
range of plant systems. A certain degree of sequence identity or conservation is retained 
10 among these proteins isolated from different plants. Given that PERKl shows homology in 
its extracellular domain to the extensin family of cell wall proteins and is rich in proline 
residues, homologues of PERKl exist in many other plant systems. 

The protein also contains two other domains of note. Hydropathy analysis (Kyte and 
Doolittle, 1982) of the protein sequence predicted a membrane spanning region of 23 amino 
1 5 acids (Figure 1 ; residues 138-160) followed by a characteristic stop transfer sequence rich in 
charged amino acids [Arg-Arg-Arg] required for the proper insertion in the membrane 
(Weinstein et aL, 1982). 

All known protein kinases display amino acid sequence similarity in their catalytic 
domains which are comprised of eleven subdomains containing some invariant residues 

20 important for catalysis (Hanks and Quinn, 1991). The overall features of this organization are 
identified in the catalytic domain of the PERKl protein in that all of the absolutely 
conserved amino acids as well as the highly conserved amino acid groups are present (Figure 
1 A). The sequences of DIKASN in subdomain VI and GTFGYLAPE in subdomain Vm 
(Hanks and Quinn, 1991) are strong indicators that PERKl may possess serine/threonine 

25 rather than tyrosine substrate specificity (Figure 1 A). 

PERKl is a Single Copy Gene and Ubiquitously Expressed in JS. napus Tissue 

As shown in Figure 2, Southern blot analysis was performed under conditions of varying 
stringency using B. napus genomic DNA digested with several restriction enzymes in order to 
determine copy number of PERKl in the Brassica genome. Based on known restriction sites 
30 within the cDNA and identical hybridization pattems obtained for low and high stringency 
conditions (Figure 2), PERKl exists as a single copy gene and is not a member of a 
multigene family. 
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In order to determine whether PERKl is expressed in plant tissues, RNA gel blot 
analysis was performed using total RNA isolated from a variety ofB, napus tissues as shown 
in Figure 3. The full length PERKl cDNA probe used in this experiment detected a transcript 
of 2.2 kb (Figure 3; upper panel) which is consistent with the size of the full length PERKl 
5 cDNA. The 2.2 kb PERKl transcript was most abundant in B, napus, stem petal and pistil 
tissue (Figure 3; upper panel). Levels of PERKl mRNA were also detected in root, leaf and 
anther tissues albeit at much lower levels (Figure 3; upper panel). As an internal control the 
blot was reprobed with a 18S rRNA to ensure even loading of the total RNA. A transcript 
detected with relatively the same intensity in all tissues indicates that equal amounts of total 
1 0 RNA was used (Figure 3; lower panel). \ The difference in the intensity of the 1 8S rRNA 
signal in anther tissue (Figure 3; lower panel) is a common problem associated with the 
desiccate nature of this tissue. 

Changes in PERKl mRNA in Response to Wounding and Chemical Elicitors 

In order to examine whether PERKl expression could be influenced by external 
15 stimuli, leaf and stem tissue of A napus plants were wounded and the abundance of PERKl 
mRNA was determined by standard Northem blot analysis using the full length PERKl 
cDNA as a probe (see Methods). Figure 4 shows changes in the steady-state levels of 
PERKl mRNA accumulation following injury. PERKl transcripts in wounded leaf tissue 
began to accumulate 5 min after woimding, reaching maximal levels within 15 rain post 
20 injury represented by an 12fold induction, A 4.5 fold increase in PERKl mRNA levels was 
detected 45 min following treatment declining towards basal levels by 2 hr (Figure 4). 

A similar projBle of PERKl mRNA steady state levels was obtained for wounded stem 
tissue (Figure 4). An accumulation of PERKl mRNA in stem is evident 5 min following 
wounding which represents a 3.6 fold induction of this gene. Maximum steady state levels of 

25 PERKl mRNA in stem was achieved 30 min after injury corresponding to a 7 fold 

induction. PERKl mRNA levels also accmnulate in wounded leaf disc tissue as well as 
following an abrasive wounding treatment (Figure 4a, 4b). Levels of PERKl mRNA also 
increase rapidly in the roots of hydroponically grown Brassica napus plants (Figure 4c). 
Therefore, the overall kinetics of PERKl mRNA accumulation in leaf, stem and root tissue 

30 after mechanical wounding is clearly a very rapid response (Figure 4). 

Defense mechanisms deployed by plants in response to wounding or pathogen attack 
have been shown to be induced by certain plant derived chemicals such as methyl jasmonate 
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(MeJA) and salicylic acid (SA). In order to examine changes in the levels of PERKl mRNA 
abundance in response to exogenous apphcation of MeJA, B, napus plants were thoroughly 
sprayed with a 5 micromolar MeJA solution. Leaf and stem tissue was subsequently harvested 
at various times and the steady state levels of PERKl mRNA were analysed. Figure 5A 
5 shows the RNA gel blot and corrected PERKl mRNA profile for treated leaf tissue during 
which no significant accumulation of PERKl mRNA was detected. In response to MeJA, 
levels of PERKl transcript in leaf tissue were very weak resembling basal levels in imtreated 
tissue (Figure 3; upper panel). Exogenous application of MeJA to stem tissue had no effect 
on tiie accumulation of PERKl mRNA as shown by the corrected profile in which the fold 
10 induction of PERKl did not deviate substantially from the untreated control (0 hr) (Figure 

5B). Furthermore, no increase in the steady state levels of PERKl mRNA was detected in the 
appropriate control treatment (0.1% [7v] ethanol, solvent control for MeJA) at time 0 hr. 

Many genes isolated to date that are induced by a pathogenic stimulus can be at least 
partially induced by SA (Ward et al., 1991). In order to address the potential role of PERKl 

15 in a plant's defense response against pathogen attack, 4mM S A was used as a chemical 
elicitor and sprayed onto B. napus plants. Figure 6A shows that when SA is exogenously 
applied to leaf tissue, PERKl mRNA accumulates 15 min following treatment reaching a 
maximum 5 fold induction 45 min post-treatment. Steady state levels of PERKl mRNA in 
treated stem tissue peaked at 45 min corresponding to an approximate 2 fold induction in 

20 response to 4mM SA (Figure 6B). 

In order to address whether levels of PERKl mRNA accumulate in response to a 
pathogen attack, Brassica napus leaf tissue was inoculated with the fungal pathogen 
Sclerotinia sclerotiorum (Figure 7). A 2.5 fold induction in the levels of PERKl mRNA was 
detected lOhrs following inoculation with the fimgus. This evidence suggests that PERKl 
25 may play a role in mediating a plant's defense response against pathogen attack. 

MATERIALS AND METHODS 
Construction of Lambda-Pistil cDNA Library 

Pistils were collected from floral buds of Westar and Wl cultivars 1-2 days before anthesis. 
30 Total RNA was isolated using the method described by Jones et al. (1985), and enriched for 
poly(A)^ mRNA by affinity chromatography using pre-packed oligo (dT)25-cellulose beads 
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(New England Biolabs, Beverly, MA). Approximately five micrograms of pistil poly(A)^ 
mRNA was used for the construction of a cDNA library using the ZAP-cDNA® synthesis kit 
(Stratagene, La Jolia, CA). The information encoded by the poly(A)'*^ mRNA was reversed 
transcribed using M-MuLV RT and converted into stable, unidirectional cDNA which was 
5 subsequently inserted into a self-replicating Uni-ZAP XR vector, packaged into phage 
particles in three separate packaging reactions and amplified as described by the 
manufacturer's procedures (Stratagene, La JoUa, CA). Infection of Escherichia coli host 
strain XLI-Blue yielded a primary library with an average titer of LOx 10^ plaque forming 
units. The primary library was subsequently amplified to obtain an av^ge total of 6.6x10*° 
1 0 plaque forming units. 

Generation of Novel Receptor-like Protein Kinase Clones 

The isolation of novel Brassica napus receptor kinases relied upon the newly 
constractedcDNA library and involved in vivo mass excision of the pBluecsript phagemids 
jfrom the Uni-ZAP XR vectors as outlined by the manufacturer (Stratagene, La JoUa, CA), 

15 Following efficient mass excision, phagemid DNA was extracted using a large scale alkaUne 
protocol as described by Sambrook et al. (1989) and subjected to the pol>inerase chain 
reaction (PGR) using two separate oligonucleotide combinations, RK1/RK2 and RK1/RK3 
(obtained firom M. Cock, Ecole Normale Superieure de Lyon, France) specifically designed to 
prime conserved subdomains of the catalytic domain of receptor protein kinases. RKl (5- 

20 ggiggTTTCggiAT^CAgTiTTVcAA^ggg - 3') served as the forward primer and was 
constructed based upon a conserved amino acid consensus (GGFGIVVyKG) within 
subdomain I of the catalytic domain. The degeneracy of one reverse primer RK2 (5* - 
AAiATiC\igCCATiCC%AA%^c - 3') reflects a conserved amino acid consensu 
(DFGMARIF) of subdomain VU which closely resembles the SRKs in Brassica. The second 

25 reverse oligonucleotide RK3 (5' - A^AiA^A^cTTigCiA^iCC^AA'^gTC - 3') was generated 

based upon conserved amino acids (DFGLAKLL) within subdomain VII prevalent among the 
RLKs isolated in Arabidopsis. Phagemid DNA was amplified in a reaction mixture 
containing 1 microliter of excised phagemid DNA, lOx PCR buffer (lOOmM Tris-HCl pH83, 
500mM KCl, 15mM MgCb), lOmM deoxyribonucleotide triphosphate mixture, 10 

30 micromolar of each oligonucleotide primer and 0.5 microliter Tsg polymerase (BioBasics, 
Canada). The PCR reaction was heated at 95**C for 2 min and amplified for 35 cycles imder 
the following amplification conditions: 1 min at 95'*C for denaturation, 1 min 30 sec at 50**C 
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for primer annealing and 1 min at 72^C for synthesis. A final extension cycle of 10 min at 
72"C was also incorporated into the ampUfication program. All PCR products generated of 
the expected size (420-450 bp) were gel purified, cloned into the pT7Blue plasmid (Novagen, 
Madison, WI) and introduced into Escherichia coli DH5- alpha. Transfonnants were tested 
5 for the presence of an insert and positive clones were sequenced with universal primCTS (R-20 
and U-19) by an ABI automated sequencer (Model 373 STRETCH DNA; Perkin Elmer 
Corp., Canada Ltd.) using the dideoxychain-terminating method described by Sanger et al. 
(1977). Sequence analyses performed using DNAsis® software (Hitachi Software, San 
Bruno, CA) at the nucleotide and amino acid levels. 

1 0 Screening of Lambda-Pistil cDNA Library 

The original 351 bp PCR product was used to screen the lambda-pistil cDNA library. 
Approximately 2x10^ plaques from the amplified library were screened and plated at a density 
of 1x10^ pfu/plate. Duplicate colony lifts were performed according to Sambrook et al. 
(1989), and prehybridized for 2 hr at 42''C in 50% formamide, 5x Denhardt's solution (Ix 

15 Denhardt's solution is 0.02% Ficoll, 0.02% DVP, 0.02% BSA), 5x SSC (Ix SSC is 0.15M 
NaCl, 0.0 15M sodium citrate), 0.1% SDS, ImM EDTA and 100:g/ml salmon sperm DNA. 
Filters were subsequently hybridized overnight in the same solution containing the 351 bp 
PERKl cDNA radiolabeled by random priming (Feinberg and Vogelstein, 1983) and washed 
twice with 2x SSC, 0.1% SDS at room temperature for 15 min, followed by two 25 min 

20 washes with 0.5x SSC, 0.1% SDS at 55"C. Plaques containing putative positive clones were 
cored and subjected to several roimds of screening until single isolates representing the 
PERKl clone were obtained. Single clone excision to liberate the double stranded 
pBluescript phagemid was performed on each isolate according to the procedure 
recommended by the manufacturer (Stratagene, La JoUa, CA). Phagemid DNA digested with 

25 EcoRI/XhoI to release the cloned cDNA was subjected to standard plasmid Southern blot 
anal>^is as described by Sambrook et al. (1989) and probed with the radiolabeled 351 bp 
PERKl cDNA. The membrane was prehybridized at 42**C in 5x SSPE, lOx Denhardt's 
solution and 0.5% SDS for 2 hr and hybridized overnight at the same temperature in a buffer 
containing 50% fomiamide, 5x SSPE and 0.5% SDS. Washing conditions were performed 

30 twice at room temperature for 15 min in 2x SSC, 0.1% SDS followed by several 30 min 

washes at 55-60°C in O.lx SSC, 0.1% SDS. An intense hybridization signal would confirm 
whether phagemids isolated from the library screen contained the cloned cDNA of interest. 
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Several positive clones were sequenced as previously mentioned using both universal and 
sequence specific primers to generate a consensus sequence representing the PERKl cDNA 
clone (1512bp) isolated from the lambda-pistil cDNA library. 

Rapid Amplification of cDNA Ends (5'RACE) 

5 The 5' end of the PERKl cDNA was obtained by the procedure for the rapid 

amplification of cDNA ends originally described by Frohman et al. (1988) using the 5' RACE 
System, Version 2.0 kit (Gibco-BRL, Gainthersburg, MD). First strand cDNA was 
synthesized from approximately 300:g of mixed Westar and Wl pistil total RNA using a gene 
specific primer GSPl (5'-TAACCAACAAgAgACA-3') designed to anneal approximately 

10 300 bp fix>m the 5' end of the PERKl cDNA (1512 bp) isolated firom the library screen. 

Following cDNA synthesis, the first strand product was purified from unincorporated dNTPs 
and GSPl using a GLASS MAX® spin cartridge. A homopolymeric tail was added to the 3' 
end of the cDNA using TdT (terminal deoxynucleotidyl transferase) and dCTP. Tailed cDNA 
was amplified using a second gene specific primer GSP2 (5*-CCACTCCCAACTTTCAAC - 

15 3') designed to anneal 3' to GSPl with respect to the cDNA, and an abridged anchor primer 
(Gibco-BRL, Gainthersburg, MD) which annealed to the homopolymeric tail. PGR 
amplification was carried out for 35 cycles of denaturation at 94'*C for 1 min, primer 
annealing at 55"C for 1 min and extension at 72**C for 2 min, followed by a final extension 
cycle for 10 min. A PGR product of the expected size (~1 kb) corresponding to the 5'end of 

20 PERKl was gel purified, cloned into the pTTBlue plasmid (Novagen, Madison, WI) and 
transformed into Escherichia coli DH5- alpha. Confirmation of the 5 'RACE product was 
obtained by plasmid Southern blot analysis as described above and by sequential primer based 
sequencing. 

Cloning of Full Length PERKl cDNA 

25 A PGR based approach was used to generate a fiill length PERKl cDNA by combining the 
5*RACE product cloned into the EcoRV site of pTTBlue with the cDNA isolated from the 
library screen cloned into the EcoRI/XhoI sites of the pBluescript SK phagemid. A forward 
primer (5'-ggAAAgCTTgCATgCCTgCAggTCgAC -3*) contaming an internal Pstt site was 
designed to anneal upstream to the EcoRV cloning site of pTTBlue. A reverse primer (5*- 

30 CgCCTgCAggTAATACgACTCACTATAggg -3 ') also containing a Psfl site was designed 

based on pBluecsript phagemid sequence immediately 3' to the EcoRI/XhoI cloning site. Full 
length PERKl cDNA was generated from a ICQ microliter PCR reaction containing 1 
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microliter ('-20ng) of each template (cDNA in pT7Blue and pBluescript phagemid), lOx Pfii 
Buffer (200mM Tris-HCl pH8.8, lOOmM (NH4)2S04, 20mM MgS04, 1% Triton®X-100, 
Img/mlBSA), lOmM dNTPs, SOpmol forward and reverse primers and 1 microliter Pfu 
polymerase (Gibco-BRL, Gainthersburg, MD). The samples were heated to 94**C for 5 min 
5 and amplified for 30 cycles with a denaturing cycle of 1 min, a pimer annealing cycle at 53**C 
for 1 min followed by an extension cycle for 3 min at 72**C. The resulting PCR product of the 
expected size (~2.2kb) was gel purified and cloned into the PstI restriction site of pBluescript 
KS (^/.) n. The fiill length PERKl cDNA sequence was confirmed by a sequential primer 
based sequencing approach using both universal and sequence specific primers as previously 
1 0 described. All DNA and protein sequence analysis was performed using the DNAsis® 
Software (Hitachi Software, San Bruno,CA). 

Genomic DNA Isolation and Southern Blot Analysis 

Genomic DNA was extracted fi:-om approximately one gram of young Brassica napus 
leaf tissue according to the method described by Goring et al. (1992b). Approximately 5:g of 

15 genomic DNA was digested with several restriction enzymes (BamHI, EcoRI, Hindlll, PstI, 
Xbal, Xhol), firactionated through a 0.8% agarose gel and transferred overnight in lOx SSC 
onto Zetaprobe membrane (Biorad, Hercules,CA). This was performed in duplicate to test 
hybridization conditions under low and high stringencies conditions. After drying, the 
membranes were prewashed in 0.1 x SSC, 0.5% SDS for 25 min at 60**C. The membranes 

20 were prehybridized and hybridized as previously described for plasmid Southern blots with 
the inclusion of 10% dextran sulfate and 50 microgram/milliliter salmon sperm DNA in the 
hybridization buffer. Washing conditions for genomic southem blots varied depending on the 
stringency tested. One membrane was washed under conditions of low stringency for 15 min 
at room temperature in 2x SSC, 0. 1% SDS followed by second 15 min room temperature 

25 wash in Ix SSC, 0.1% SDS and three final washes at 50**C in Ix SSC, 0,1% SDS„ The 
second membrane was washed under conditions of high stringency by lowering the salt 
concentration to 0.1 x SSC, 0.1% SDS and increasing the temperature to 65**C. The ^^P- 
labeled 1512 bp PERKl cDNA probe was generated by random priming as described by 

♦ 

Feinberg and Vogelstein (1983). Membranes were subjected to autoradiography (XAR-5 
30 film, Kodak) overnight at -80"C. 
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Isolation and Northern Blot Analysis of Multiple Tissue RNA 

Total RNA was extracted from a mixture of Westar and Wl root, stem, leaf, petal, 
anther and pistil tissue as described by Jones et al. (1985). Approximately 40 micrograms of 
total RNA was fractionated on a 1.2% fomialdehyde gel (Sambrook et al., 1989) and 
5 transferred to Zetaprobe membrane (Biorad, Hercules, CA) in lOx SSC. Hybridization and 
high stringency wash conditions were conducted as previously described for genomic 
Southern blot analysis. The membrane was subsequently probed with a ISSrRNA as an 
internal control for even loading of total RNA. 

Plant Treatments 

10 B, napus plants were grown in a growth chamber at 22^C with a 1 6hr light period 

followed by an 8hr dark period at 16**C. Experiments were conducted on two month old 
plants, and all experiments used one plant per time point from which leaf and stem tissue was 
harvested. 

Wounding of leaf material was performed by punching out leaf discs every 1cm 
1 5 around the perimeter of the leaf blade ensuring that the midvein remained intact, and stems 
were wounded by slicing into l-3cm segments. The wounded tissues were placed in petri 
dishes containing filter paper moistened with 20mM sodium phosphate buffer supplemented 
with 50 microgram/milliliter chloramphenicol to prevent bacterial contamination of the 
wounded tissue (Shirsat et al., 1996). A control (0 hr) time point for this experiment was 
20 performed by incubating unwounded tissue in the sodium phosphate buffer. Wounded leaf 
and stem tissue was harvested at different times after woimding (0 hr, 5 min, 1 5 min, 45 min, 
1 hr, 4 hr, 12 hr, 24 hr, 36 hr, 48 hr). The procedures for the other wounding treatments 
assayed are described in detail in the figure legends (Figure 4a-c). 

Plants were thoroughly sprayed with 50 micromolar methyl jasmonate (MeJA; 

25 Sigma, St. Louis, MO) (Titarenko et al., 1997) and 4mM salicylic acid (SA; Sigma, St. Louis, 
MO) solutions (Schweizer et al., 1998). Leaf and stem tissue was harvested at various time 
points (0 hr, 5 min, 15 min, 30 min, 45 min, 1 hr, 2 hr, 4 hr, 12 hr, 24 hr, 36 hr, 48 hr, 72 hr, 
96 hr) following SA and MeJA treatments. Control sprays were performed with the carrying 
solutions in the absence of the chemical inducer. Carrying solutions were 5mM phosphate 

30 buffer, pH 7 for SA, and 0.1% [7v] ethanol for MeJA. The method of inoculation used for 
the fugal treatment is described in the figure caption for Figure 7. 
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Total RNA was extracted from treated tissue according to the method described by 
Cock et al. (1997). Depending on the treatment, varying amounts of total RNA (20-40 
micrograms) was electrophoresed on a 1 .2% formaldehyde and standard Northern blot 
analysis was performed as described by Sambrook et al. (1989). Hybridization and washing 
5 conditions were performed as outlined for the multiple tissue northern blot. Following 
autoradiography, the amounts of radioactive signal were quantified using Instant Imager 
Electronic Autoradiography (Packard, Meriden, CT). The membranes were reprobed with the 
cyclohpilin cDNA and the amoimts of hybridized radiolabeled cyclophilin were quantified in 
the same manner. The relative amoimts of RNA hybridized to the fixU length PERKl cDNA 
10 probe were determined after correction for differences in the amounts of cyclophilin RNA. 

The present invention has been described in detail and with particular reference to the 
preferred embodiments; however, it will be understood by one having ordinary skill in the art 
that changes can be made thereto without departing firom the spirit and scope of the invention. 

All articles, patents and other docim[ients described in this application (including database 
15 sequences and/or accession nimibers), US patent application no. 60/149,466 filed on August 

19,1999 (entitled "Characterization of a Novel Receptor Kinase from Brassica with a 

Putative Role in Plant Defence"), US patent application no. 60/159, 122 and US patent nos. 

5,612,191, 5,763,21 1, 5,750,848 and 5681714, are incorporated by reference in their entirety 

to the same extent as if each individual publication, patent or document was specifically and 
20 individually indicated to be incorporated by reference in its entirety. They are also 

incorporated to the extent that they supplement, explain, provide a background for, or teach 

methodology, techniques and/or compositions employed herein. 
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We claim: 

1 . An isolated nucleic acid molecule encoding a proline-rich, extensin-like receptor kinase 
(PERK) polypeptide, or a fragment of a PERK polypeptide having PERK activity. 

2. The molecule of claim 1, wherein the polypeptide is a signaling molecule associated with 
5 the cell wall via its extensin-like extracellular domain and is involved in the transduction 

of extracellular stimuli into an intracellular response through a cytoplasmic kinase 
domain, thereby bridging the cell wall-plasma membrane continuum. 

3. The molecule of claim 1, wherein the extracellular stimuli includes wounding or pathogen 
attack. 

10 4. The molecule of claim 1 , wherein the wounding is selected from the group consisting of a 
cut, a break, a tear, a fold and an insect wound. 

5. The molecule of claim 3, wherein the pathogen comprises bacterial pathogens, frmgal 
pathogens, Sclerotinia sclerotiorum, Cylindrosporium concertricum, Phoma lingam^ 
Pseudomonas syringae, Streptomyces scabies. Blackleg, Whiterust, Fusarium Head 

15 BUght, Rust, Bunt, Leaf Spot , White mold, root rot, Fusarium ear rot 

6. An isolated nucleic acid molecule encoding a PERK polypeptide, a fragment of a PERK 
polypeptide having PERK activity, or a polypeptide having PERK activity, comprising a 
nucleic acid molecule selected from the group consisting of: 

(b) a nucleic acid molecule that hybridizes to a nucleic acid molecule consisting of 
20 [SEQ ID NO: 1 ], or a complement thereof under low, moderate or high stringency 

hybridization conditions wherein the nucleic acid molecule encodes a PERK 
polypeptide or a polypeptide having PERK activity, 

(b) a nucleic acid molecule degenerate with respect to (a), wherein the nucleic 
molecule encodes a PERK polypeptide or a polypeptide having PERK activity. 

25 

7. The nucleic acid molecule of claim 2, wherein the hybridization conditions comprise low 
stringency conditions of IXSSC, 0.1% SDS at 50^C or high stringency conditions of 
O.IXSSC, 0.1% SDS at 65*^0. 
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8. An isolated nucleic acid molecule encoding a PERK polypeptide, a fragment of a PERK 
polypeptide having PERK activity, or a polypeptide having PERK activity, comprising a 
nucleic acid molecule selected from the group consisting of: 

(a) the nucleic acid molecule of the coding strand shown in [SEQ ID NO: 1 ], or a 
5 complement thereof; 

(b) a nucleic acid molecule encoding the same amino acid sequence as a 
nucleotide sequence of (a); and 

(c) a nucleic acid molecule having at least 17% identity with the nucleotide 
sequence of (a) and which encodes a PERK polypeptide or a poypeptide 

1 0 having PERK activity 

9. The nucleic acid molecule of any of claims 1 to 8, wherein the PERK polypeptide 
comprises a PERKl polypeptide. 

10. The nucleic acid molecule of claim 1, comprising all or part of a nucleotide sequence 
shown in [SEQ ID NO:l] or a complement thereof. 

15 11. The nucleic acid molecule of claim 1, consisting of the nucleotide sequence shown in 
[SEQ ID NO:l] or a complement thereof. 

12. A PERKl nucleic acid molecule isolated from Brassica^ or a fragment thereof 

13. The nucleic acid molecule of claim 12, wherein the Brassica comprises Brassica napus 
Brassica juncea, Brassica rapa or Brassica oleracea. 

20 14. A recombinant nucleic acid molecule comprising a nucleic acid molecule of any of claims 
1 to 4 and a constitutive promoter sequence or an inducible promoter sequence, 
operatively linked so that the promoter enhances transcription of the nucleic acid 
molecule in a host cell. 

1 5. The nucleic acid molecule of claim 1 , wherein the molecule comprises genomic DNA, 
25 cDNA or RNA . 

16. The nucleic acid molecule of claim 13, wherein the nucleic acid molecule is chemically 
synthesized. 

17. An isolated nucleic acid molecule comprising a nucleic acid molecule selected from the 
group consisting of 8 to 10 nucleotides of the nucleic acid molecule of claim 6, 1 1 to 25 
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nucleotides of the nucleic acid molecule of claim 6 and 26 to 50 nucleotides of the nucleic 
acid molecule of claim 10. 

18- The nucleic acid molecule of claim 1, comprising at least 30 consecutive nucleotides of 
[SEQ ID NO:l] or a complement thereof. 

5 1 9. A vector comprising the nucleic acid molecule of any of claims 1 to 4. 

20. The vector of claim 19, comprising a promoter selected from the group consisting of a 
super promoter, a 35S promoter of cauliflower mosaic vims, a chemical inducible 
promoter, a copper-inducible promoter, a steroid-inducible promoter and a tissue-specific 
promoter. 

10 21 . A host cell comprising the recombinant nucleic acid molecule of claim 1 or the vector of 
claim 19, or progeny of the host cell. 

22. The host cell of claim 21, selected from the group consisting of a ftmgal cell, a yeast cell, 
a bacterial cell, a microorganism cell and a plant cell. 

23. A plant, a plant part, a seed, a plant cell or progeny thereof comprising the recombinant 
1 5 nucleic acid molecule of claim 1 5 or the vector of claim 1 9. 

24. The plant part of claim 23, comprising all or part of a leaf, a flower, a stem, a root or a 
tuber. 

25. The plant, plant part, seed or plant cell of claim 23, wherein the plant, plant part, seed or 
plant cell is of a species selected from the group consisting of Brassica napus, Brassica 

20 rapa, Brassica juncea, Brassica oleracea^ or from the family Brassicaecae, Arabidopsis, 

potato, tomato, tobacco, cotton, carrot, petunia, simflower, strawberries, spinach, lettuce, 
rice, soybean, com, wheat, rye, barley, sorgum and alfalfa. 

26. The plant, plant part, seed or plant cell of claim 23, wherein the plant comprises a dicot 
plant. 

25 27. The plant, plant part, seed or plant cell of claim 23, wherein the plant comprises a 
monocot plant. 

28. An isolated polypeptide encoded by and/or produced from the nucleic acid molecule of 
any of claims 1 to 4, or the vector of claim 19. 

29. An isolated PERK polypeptide or a fragment thereof having PERK activity. 
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30. An isolated polypeptide, the amino acid sequence of which comprises at least ten 
consecutive residues of [SEQ ID NO:2]. 

31 . An isolated immunogenic polypeptide, the amino acid sequence of which comprises at 
least 8 consecutive residues of [SEQ ID NO:2]. 

5 32. An isolated polypeptide, the amino acid sequence of which comprises residues 1 to 137, 
138 to 160 and 161 to 648 of [SEQ ID NO:2]. 

33. The polypeptide of claim 29 comprising all or part of an amino acid sequence in [SEQ ID 
NO:2]. 

34. A polypeptide fragment of the PERK polypeptide of claim 29, or a peptide mimetic of the 
1 0 PERK polypeptide. 

35. The polypeptide fragment of claim 33, consisting of at least 20 amino acids, which 
fragment has PERK activity. 

36. The fragment or peptide mimetic of claim 34, which is capable of being bound by an 
antibody to the polypeptide of claim 28. 

1 5 37. The polypeptide of claim 28 which is recombinantly produced. 

38. An isolated and purified polypeptide comprising the amino acid sequence of a PERK 
polypeptide, wherein the polypeptide is encoded by a nucleic acid molecule that 
hybridizes under moderate or stringent conditions to a nucleic acid molecule in [SEQ ID 
NO:l], a degenerate form thereof or a complement. 

20 39. A polypeptide comprising a sequence having greater than 20% sequence identity to the 
polypeptide of claim 33. 

40. The polypeptide of claim 28, wherein the polypeptide comprises a PERK polypqjtide. 

41 . The polypeptide of claim 40, isolated from Brassica, 

42. The polypeptide of claim 41, wherein the Brassica comprises Brassica napus or Brassica 
25 juncea or Brassica rapa or Brassica oleracea. 

43. The polypeptide of claim 39, comprising a kinase domain including at least 30% 
homology to the kinase domain of [SEQ ID NO.:2] and/or an extracellular domain 
including at least 20% homology to the extracellular domain of [SEQ ID NO.:2]. 

44. An isolated nucleic acid molecule encoding the polypeptide of claims 29 or 33. 
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45. An antibody directed against the polypeptide of claim 33. 

46. The antibody of claim 45, comprising a monoclonal antibody or a polyclonal antibody. 

47. An isolated nucleic acid molecule encoding a polypeptide that reduces the severity of 
wounding or pathogen attack in a plant, the polypeptide comprising: 

5 (a) an extracellular domain which recognizes an extracellular binding molecule 

whose level is increased during the woimding or pathogen attack, the extracellular 
domain encoding a plurality of repeats selected from the group consistmg of SPPPP, 
SPP, PP and PPP, wherein a pluraUty of the proline molecules are capable being 
glycosylated and/or hydroxylated; 

10 (b) a membrane domain operably connected to the extracellular domain, wherein 

the membrane domain is capable of extending across a cell membrane from the 
extracellular side of the membrane to intracellular side of the membrane; and 

(c) a cytoplasmic domain operably connected to the membrane domain, wherein 
the cytoplasmic domain comprises a means for producing kinase activity when the 
1 5 extracellular binding molecule interacts with the extracellular domain. 
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ATOTCCTCQOCOCCOTCTCCOOOO 24 
MSSAPSPQ 
25 ACraOTTCGCCTCCATCTCCACCATCAAACTCCACAACCACCACTCCTCCTCCAOCTTCC 84 

TOSPPSPPSM'STTTTPPPAS 
as aCTCCTCCTCCCACCACACCTTCTTCTCCTCCaCCOCCATCCACTATTCCGACATCTCCT 144 

APPPTTPSSPPPPSTtPTSP 
145 CCTCCTTCTTCTCOCTCTACACCTTCTOCTCCTCCTCCATCTCCACCAACTCCATCTACO 204 

PPSSRSTPSAPPPSPPT.PST 
Z05 CCOOOATCTCCACCTCCrrCTTCCTCAOCCOTCTCCACCCOCTCCAACTACOCCCOOATCT 264 

POSPPPLPQPSPPAPTTPOS 
265 CCACCCOCACCTGTTACTCCTCCTACTCOAAACCCTCCACCTTCAOTCCCAOOACCACCa 324 

PPAPVTPP'TRNPPPSVPOPP 
325 TCCAATCCTTCACOCOAAaQAOOATCTCCTCOACCTCCATCTrCTCCCTCaCCOCCOTCT 384 

SHPSREQOSPRPPSSPSPPS 
385 CCTTCTTCCOACQQTTTATCAACAngAnTnnrnnTnnnfcATrnrrATrQQAQQAm-rnn-r 444 

PSSDOLSTOVVVO lAIOOVA 
CTQCTT (TTn AT A nrn AtriCTQATTTflTtnrPFTPTfrr AAflA Afl aa Arn* rnM.nh.nM. n n> > 5Q4 

LLVIVTLICLLCRKKRRROE 
505 OAAOATaCTTACTATOTTCCTCCaCCACCTCCTCCTOtrrCCCAAAOCCOOAOOACCTTAC 564 

EDAYYVPPPPPPaPKAOOPy 
565 aOTOOACAaCAOCAACAATGOCOOCAACAAAACOCAACACCACCaTCAOATCATOTCOTO 624 

GOQQQQWRQQHATPPSDHVV 
625 ACOTCACTACCACCACCACCTAAOOCTCCATCTCCACCACGGCAACCTCCTCCACCTCCA 684 

TSLPPPFRAPSPPRQPPPPP 
685 CCACCGCCTTTCATOAaCAOCAOCGGCaOCTCCOACTACTCOOACCGTCCAOTrCTTCCT 7 44 

PPPFMSSSOOSDYSDRPVLP 
7 45 CCACCOTCTCCAOOOCTrGTGrrAOOCTTCTCCAAAAGCACTTTCACATACOAOOAOCTA 804 

PPSPOLVLOFSKSTFTYEEL 
805 OCTAGAOCCACCAATOaTTTCTCCOAaaCOAACTTaTTAOOACAAOOCOOOTTCOOTTAC 864 

A R A T H *a^„t._SL-JE-,„A.^^.-k-^t .P,,JL.,Q_^ -9-.^f-. 0,jr 
865 OTOCACAAAOOTOTOTTOCCTAGTOOOAAAOAAOTTOCTOTOAAOCAOTTOAAAOTTOao 924 

925 AOTaaTCAOGOAOAOAOGGAOTTTCAOGCAOAOOTTOAOATCATCAGCAQAGTTCACCAC 984 

S — Q *"Q_it__E._J__i_.E Q..^-. .E-._ .V E„J l-_5-JL_.X^'^« H 

985 AOOCATCTOOTOTCTCTTaTTOGTTATTGCATCaCCOOTGCCAAAAGATrOCTTaTCTAT 

R-JH — L..-.^.^^._U.-Jt-_Q_„X_.C__I A "Q-.-A^-B R^L L _v Y 

1045 OAOTrrOTTCCTAACAACAATCTCO AGCTTCACCTCCATGOCGAOGOACGGCCTACAATO 

I IDS OAATaOAOCACCAOATTOAAOATTOCTCTTOaATCTOCTAAAOOACTrTCTTATCrrCAT 

E.-Jflf — S-^T™R._JL„.K__J[ A ^^^r . Y _ J.. .H 

1165 O AAOATTOCAATCCTAAAATCATTCACCOTGATATCAAGGCTTCAAAC ATATTOATAGAT 

E__D C H P K T T^PJ^^ ^ ^ ^ "JL' ' 
12 25 TTC AAGTTTOAAaCTAAOGTTOCTG ATTTTOOTCTTQCT AAO ATTO*CITCTOATAC A^ 

1285 ACOCATOTATC AACACGTOTGATOGOAACCTTTOOOTACTTOOCTCCOOAATACOCTOCA 
T_JjL_v ^'"^^_..a-_..V„.ijL ^Q T F o f A p F^_x "A. _A 

1345 AOCGGAAAOCTCACGOAOAAOTCTOACGTTTTCTCATTTOOCOTTGTGCTTTTGOAOCTC 

1405 ATTACTGOACOTCaACCCaTTGATfedAACAATOTCTATOTAOATOACAOCTTAOTTOAC 1464 

i_jr — a__B — .z ... y ,_o a_._h ^» *v„„y,_y p, p _ s l _ y _ 

14 65 TOGGCACOACCATTGCTTAACCGAaCATCTOAOCAAGGAOACTTTOAGOOTTTAGCTOAT 1524 

— B__P — I L_R_R .A S-.,E. ..Q 0.. ,P,-.E E Q - L A Q 

1525 aCAAAQATOAATAATGGGTATGACAOAOAaOAGATGaCTCaCATGGrTOCTTGTOCTGCG 

A-_Jt._Jl_a.Jli_(L.,JC-^0_a™_£ »'E^.M- _A _ R W V A C A ^ 
1585 OCTTOTOTTCOCCATTCAGCTCOCCGCAOACCTCOCATOAGCCAGATTaTOCOTOCGTrA 

A — C-___M — R_JHL-_5. .^-_JR R-^-P S— II — S^. Q 1 V B__A_JU 

1645 GAAGOAAATQTATCACTOTCAOATCTTAACOAAOGOATOAOACCAGQTCAAAOCAATaTA 1704 

EGM*VSLS0LHEOMRPGQSHV 
1705 TACAQCTCATACGGAOOAAGCACCOATTATOACTCGAGCCAOTACAATOAAOACATOAAO 

YSSYGOSTDYDSSQYHEDMK 
1765 AAQTTTAGOAAAATaOCACTTOGAACTCAAGAOTACAACQCCACOGGTGAOTACAaTAAT 1824 

KF RRMALOTQEyN*ATGEYS|I 
1 8 25 C CGACC AGTaACTATGOACTOTACC CGTCTOaTTCAAOCAOCGAGGGCCAAACCACACGC 

PTSDYQLYPSGSSSEGQTTR 
1885 QAAATOOAGATOOOGAAQATTAAGAOAACCQGTCAaQOTTATAGTGQACCTTCTCTTTAA 

EMEMGKI RRTOQOYSGPSL • 
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All have no signal peptide and predicted 
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TAGAAAAAAA AA AATGTCAG ACTTAGGCGA GTCGCC GAGT TCTTCACCAC CAGCACCACC 
AGCTGATACC GCTCCTCCAC CAGAGACTCC ATC AGAAAAC TCAGCTCTTC CACCTGTTGA 
TTCCTCTCCT GCTAGTCCAC CAGCTGATTC AT CATCAACA CCGCCGCTGT CAGAACCATC 
CACTCCTCCT CCAGATTCAC AGCTTCCTCC TTTACCTTCG ATTCTTCCTC CGCTAACAGA 
TTCTCCACCT CCACCTTCCG ATTCTTCTCC ACCCGTTGAT TCAACCCCTT CTCCGCCGCC 
ACCX3ACGTCA AACGAATCTC CTTCTrgTCC AGAAGATTCC GAAACACCAC CTGCTCCACC 
AAATGAATCC AATGACAACA ACCGTCCTCC GTCTCAAGA T CTTCAATCGC CTCCTCCATC 
GTCGCCGTCG CCGAATGTAG GACCCACAAA CCCGGAATCA CCACC GTTAC AATCTCCTCC 
AGCTCCACCA GCATCAGATC CTACAAATTC ACCG CCAGCT TCACCATTAG ACCCTACCAA 
TCCTCCCCCA ATACAACCAT CAGGACCAGC CACTTCTCCT CCGGCTAATC CCAACGCTCC 
GCCGAGCCCA TTCCCCACAG TACCACCCAA AACTCCTTCT AGTGGACCTG TGGTGTCTCC 
ATCTCTCACA TCCCCTAGTA AAGGAACTCC TAC TCCAAAC CAAGGCAATG GAGATGGCGG 
TGGCGGTGGT GGCGGCTATC AAGGGAAGAC TATGGTTGG T ATGGCTGTAG CCGGTTTCGC 
AATCATGGCG CTTATAGGCG TTGTGTTCTT AG TGAGAAGA AAGAAAAAGA GAAACATTGA 
TAGCTATAAT CACTCACAGT ACTTGCCACA TCgCAATTTC TCTGTTAAAT CAGGTTTAAA 
AATCTCACCT TTATCTCTCT CTGATCATCT TCTATGTGCT TGAATCATCT CTCTGACTAT 
CTTTGCTTTT GATGTAG ATG GATTCTTATA CGGT CAAGAT CGAGGTAAAG GATACTCCTC 
TGGTCCTAAT GGTTCAATGT ATAACAATTC ACAGCAACAA CAATCCTCTA TGGGAAACAG 
TTATGGTACA GCTGGTGGTG GTTATCCTCA TCATCAA ATG CAATCAAGTG GCACACCTGA 
CTCTGCTATA CTCGGAAGTG GCCAGACTCA TTTCAGT TAC GAAGAGCTTG CTGAGATAAC 
ACAAGGCTTT GCTCGCAAAA ACATTCTTGG AGAAGGCGGA TTTGGATGTG TCTATAAAGG 
TACATTGCAG GATGGTAAAG TTGTTGCGGT TAAGCAGCTT AA AGCTGGAA GTGGACAAGG 
TGACCGTGAA TTCAAAGCAG AGGTTGAGAT CATCAGC CGC GTTCATCATC GCCATTTGGT 
CTCTCTGGTT GGTTACTGCA TTTCAGACCA GCATAGAT TG CTTATCTATG AGTATGTTTC 
TAATCAAACC TTGGAGCATC ATTTGCATGG TGAGTGACTT GTTACCATTT TCGTTATAGA 
TAAGACTTTT TTTTAGCTTT ACGTGTTAGA CTGACTCGCT TTACGCTTTA GGAAAGGGTT 
TGCCAGTTTT A GAGTGGTCT AAGAGAGTCC GGAT CGCTAT AGGATCAGCC AAAGGGTTGG 
CATATCTTCA CGAAGACTG T AAGTAATGCC TTCACATTTT CTTAGTTGTG TGCTTTGGTT 
ATGCACTTCA TAGTTTAAAC AGAAGCCAAA AATCATATCC TTGTTTTATT TTACAGGT£A 
TCCGAAAATC ATTCACAGAG ATATAAAGTC AGCAAA TATT CTTCTAGATG ATGAATATGA 
AGCTCAGGCA ATAATGAAAT CCTCCTTTTC GTTAAATC TA TCTTATGACT GTAAAGTTTT 
AGTTAATGAG ACTTGTTCTG TTTTTTGGAT GTTTAG GTTG CTGATTTTGG ACTTGCTAGA 
CTCAATGATA CAACACAAAC TCATGTTTCA A gTCGGGTTA TGGGAACCTT CGGGTAAGCA 
AACATTCATC ACAAACTCTA CTCCAAAACT GGACCTTATT GATCCAATGC CTGATGAAAA 
GTTTGTTATA TATGGCTTGA GGCAACAAAT TGGATCAAAC CTGAATCTTT ATTGATCGTA 
TGGCTGCATG ACATGTTTTG TfrTTAA GGTA CCTAGC GCCG GAATATGCAT GAAGTGGAAA 
ATTGACTGAT AGATCCGATG TATTCTCATT CGGGGTTGTT CTC TTAGAGC TTGTAACTGG 
ACGGAAACCA GTTGACCAGA CTCAGCCTCT AGGA GAAGAG AGTTTGGTTG AATGGGTAAG 
AATCCAACTT TCAAACATTC TTCAATAATA GTAAGATTGG CCCTAGTATA CTTATATAGT 
ACTTATAAAT GAACTCACAG GCGCGCCCGC TGCTTCT CAA AGCCATTGAG ACCGGAGATT 
TAAGCGAACT GATTGATACA CGGCTTGAAA AGCGTTATGT GGAGCATGAA GTCTTCAGAA 
TGATCGAGAC AGCCGCTGCA TGTGTTAGAC AT TCTGGTCC AAAACGCCCA CGCATGGTTC 
AGGTAATTCT GACTAACCAA AAGTCCAAAG CTCCCATATA TAGTAACAAG TGATTTCTCA 
CATCTGAAAA CTTATCTACT CTTCGAAATA A GGTTGTGA G AGCATTGGAC TGCGACGGAG 
ACTCGGGAGA TATTAGCAAC GGAATCAAAA TTQGGCAAA G CACAACTTAT GACTCAGGGC 
AATACAATGA AGACATTATG AAATTCAGGA AAATGGCGTT TGG TGGTGAT AACAGCGTAG 
AGTCAGGATT GTACAGTGGA AACTACTCTG CCAAAAG CTC TTCAGATTTC TCAGGGAATG 
AATCTGAGAC TCGGCCTTTC AACAACCGAC GGTTCTGAT C ATACAATAGG TGAAAGTAAC 
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721 GGC TAT CAA GGC AAC ACT ATG GTT GGT ATG GCT GTA GCC GGT TTC CCA ATC ATS GCG CTT 
261 r G V V F L V RRKKKRNIDSYNH 
781 ATA GGC CTT CTC TTC TTA GTG ACA AGA AAG AAA AAG ACA AAC ATT CAT AGC TAT AAT CAC 
281 SQYLPH PMFSVKSOGFLyGQ 
841 TCA CAG TAC TTG CCA CAT CCC AAT TTC TCT GTT AAA TCA GAT GGA TTC TTA TAC GCT CAA 
301 DPGKGYSSGPNGSMYNNSQQ 
901 GAT CCA GGT AAA GGA TAC TCC TCT GGT CCT AAT GGT TCA ATG TAT AAC AAT TCA CAC CAA 
321 QQSSMGNSYGTAGGGYPKHQ 
961 CAA CAA TCC TCT ATG GGA AAC AGT TAT GCT ACA GCT GGT GGT GGT TAT CCT CAT CAT CAA 
341 MQSSGTPDSAX LGSGQTHFS 
1031 ATS CAA TCA AGT GGC ACA CCT GAC TCT CCT ATA CTC GGA AGT CCC CAG ACT CAT TTC AGT 

361YBELAE ITOGPARKNILGBG 
1081 TAC GAA GAC CTT GCT CAG ATA ACA CAA CCC TTT GCT CGC AAA AAC ATT CTT GGA GAA GGC 

381 GFGCVYKGTLODCKVVAVKO 
1141 GGA TTT GGA TGT CTC TAT AAA GGT ACA TTC CAG GAT GGT AAA CTT GTT CCG GTT AAC CAG 

401 LKAGSCQ60REFKAEVEZ 15 
1201 CTT AAA GCT GGA AGT GGA CAA GGT GAC OCT GAA TTC AAA GCA GAC GTT GAS ATC ATC AGC 

421 RVKKRHLVSLiVGyCISDQHR 
1261 CGC CTT CAT CAT CGC CAT TTC CTC TCT CTC GTT GCT TAC TCC ATT TCA GAC CAG CAT AGA 

441 LLIYEYVSMQTLEHHLHKWS 
1321 TTG CTT ATC TAT GAG TAT GTT TCT AAT CAA ACC TTC GAG CAT CAT TTG CAT GAC TCG TCT 

461KRVRIAIGSAKGLAYLHEDC 
1381 AAG AGA CTC CGG ATC GCT ATA GGA TCA GCC AAA GGC TTG GCA TAT CTT CAC GAA GAC TGT 

481 HPKIIHRDIKSANILLODEY 
1441 CAT CCG AAA ATC ATT CAC AGA GAT ATA AAG TCA GCA AAT ATT CTT CTA CAT GAT GAA TAT 

S01EAQAIMK$5FSLN1.SY0CKV 
1501 GAA GCT CAG GCA ATA ATG AAA TCC TCC TTT TCG TTA AAT CTA TCT TAT GAC TGT AAA CTT 

521 LVAOFGLARLNDTTQTHVS T 
1561 TTA GTT GCT GAT TIT GGA CTT GCT AGA CTC AAT GAT ACA ACA CAA ACT CAT GTT TCA ACT 

541 RVMGTFGYLAPEYASSGK LT 
1631 CGC GTT ATC GGA ACC TTC CGC TAC CTA GCG CCG GAA TAT GCA TCA AGT GGA AAA TTG ACT 

561 DRSDVFSFGVVLLCLVTGRK 
1681 GAT AGA TCC GAT GTA TTC TCA TTC CGG GTT GTT CTC TTA GAG CTT CTA ACT GGA CGG AAA 

581 PVOOTQPLCBESLVeWARPL 
1741 CCA CTT GAC CAG ACT CAG CCT CTA GGA GAA GAG ACT TTG CTT GAA TGG GCG CGC CCG CTG 

601 LLKAIETGDLSELIDTRLEK 
1801 CTT CTC AAA GCC ATT GAG ACC GGA GAT TTA ACC GAA CTC ATT GAT ACA CCG CTT GAA AAC 

621 RYVEHEVPRMIBTAAACVRH 
1861 CCT TAT GTG GAG CAT CAA GTC TTC AGA ATS ATC GAC ACA CCC CCT CCA TGT GTT AGA CAT 

641 SGPKRP RHVQVVRALDC06D 
1921 TCr GGT CCA AAA CGC CCA OGC ATG GTT CAG GTT GTG AGA GCA TZG GAC TGC GAC GGA GAC 

661SG0rSKGlKlCQSTTYDSCQ 
1981 TCC GGA CAT ATT AGC AAC GGA ATC AAA ATT GGG CAA AGC ACA ACT TAT GAC TCA CGG CAA 

681 YNEDIMKFRKMAFGGDNSVE 
2041 TAC AAT GAA GAC ATT ATG AAA TTC AGC AAA ATG GCG TTT GGT GGT GAT AAC ACC GTA GAG 

701 SGLYSGWYSAKSSSDFSGME 
2101 TCA GGA TTC TAC ACT GGA AAC TAC TCT GCC AAA AGC TCT TCA GAT TTC TCA GGG AAT GAA 

721 SETRPFHNRRF* 
2161 TCT GAG ACT CGC CCT TTC AAC AAC CCA CGG TTC TGA 
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GAAAATTTTG ATCTCCG ATG GCTTCTTCTC CTGAATCTGC TC CTCCAACA AACTCCACCT 
CTTCTCCATC TCCACCGTCT AATACCAATT CAACCACCT C TTCTCCGCCG GCTCCGTCTC 
CTCCTTCTCC TACACCTCCT CAAGGAGACT CATCATCAT C GCCACCTCCT GATTCCACAT 
CTCCACCAGC TCCACAAGCT CCTAACCCTC CTAATTCCTC TAATAA CTCT CCTTCCCCTC 
CGTCACAGGG CGGTGGAGGA GAAAGAGGAA ATGGAGGAA A CAATGGTGGC AATGATACTC 
CACCGTCACG CGGCTCTCCT CCTTCTCCTC CTTCTAGGAG TAAT GGAGAT AATGGTGGTA 
GCAGATCATC GCCACCAGGA GACACTGGAG GCTCTCGCT C AGACAACCCT CCTTCTAGCG 
GAGGAAGCAG TGGAGGAGGT GGAGGTGGAA GAAGTAATAC GAAT ACAGCG ATCATAGTTG 
GTGTATTAGT CGGAGCTGGA CTTTTGATGA TCGTTCT TAT TATTGTGTGT CTTAGACGCA 
AAAAGAAGAG AAAAGACTCC TTCTACCCTG AACCCATGAA AGG TAAAAAC ATATACACAC 
TCTTATGTTT CAACAAATAA GAAGCTTAGA TTCTTTCATA AAATTTCAGG AAACCAATAT 
CAATACTATG GAAACAACAA CAACAACAAT GCTTCACA GA ATTATCCGAA TTGGCACCTA 
AATTCACAAG GCCAAAACCA ACAATCTACT GGTGGTT GGG GAGGCGGTGG ACCATCACCG 
CCTCCTCCTC CGCGGATGCC TACAAGCGGA GAAGATTCTT CCAT GTACTC AGGCCCATCA 
CGCCCAGTTT TACCTCCTCC TTCGCCTGCT CTAGCCCTCG GATTCA ACAA GAGCAc i i i I 
ACTTACCAAG AGCTTGCGGC TGCAACAGGA GGGTTT ACGG ATGCTAACCT TTTGGGACAG 
GGAGGATTTG GGTATGTCCA TAAAGGAGTC TTGCCTAGCG GGAA AGAAGT AGCAGTTAAG 
AGTTTAAAAG CGGGTAGCGG ACAAGGAGAG AGGGAGTTTC A AGCTGAGGT CGATATCATT 
AGCCGTGTGC ATCATCGGTA TCTTGTTTCT TTGGTTGGAT ATTGCA TAGC TGATGGACAG 
AGGATGTTGG TTTATGAGTT TGTTCCTAAC AAAACTTTGG AATATCA TCT TCATGGTTAG 
ACCACTTAAA AACTTTGAGT ACTAAGTTTA TTTTCTCTAA TCTATATATT CAAGAAAGTT 
GTAACCTTAA TTTGTTGTTG TAG GGAAAAA TCTTCCGGTA ATG GAGTTCT CCACTAGGTT 
GCGTATCGCC TTAGGTGCTG CGAAAGGACT CGCTTACCTT CACGAAGACT GTAAGTTTTA 
ACATTCACCA TTCTCATTTT CTTAACCAAG TTGCATAAAA CA6AGAAAGC TCTGTCTCTG 
ACTAGTGTTA TCTTTTTGGC TGAGAAAATG GTGC!AGGCCA TCCTCGGATC ATTCACCGCG 
ACATCAAGTC TGCAAATATT CTCTTGGACT TCAACTTTGA TGC TATGGTG ATAAACTAGT 
AGCTTGCATT CATCTACGGT TTTTTGTTAA GACTACATTG ATGACATTTT GCATTTGTTT 
ATTCAG GTGG CTGATTTTGG ATTAGCTAAG TTAACATCTG ATA ACAACAC TCATGTATCT 
ACTCGTGTGA TGGGAACTTT CGGG TAAGCG TTTTACCGTA TGATAAGATT GTTCGTGACA 
CTCAAGAAAC ATAACCTTTG TAGACTAATC TACTTTGCTT CTTTCCACAA ACATGTGTAG 
ATATCTAGCT CCAGAATATG CTTCAAGCGG TAAATTAACC GAGAA ATCCG ATG'XTTTCTC 
TTACGGAGTT ATGTTATTGG AACTTATAAC TGGAAAACGA C CGGTTGATA ATAGCATCAC 
CATGGACGAC ACCTTAGTAG ATTGGG TATT CATGCATGTA ACATATGTAT CGTGTATATA 
TGTTTTTCGC CTTTTTCGCG TACTAATGAT CATGAATACA GGCTCGGCCT CTTATGGCTC 
GCGCGCTAGA AGATGGAAAC TTTAATGAGC TCGCAGATGC GAGGCTTG AA GGCAACTACA 
ACCCGCAAGA AATGGCTCGA ATGGTGACTT GTGCCGCTGC TAGCA TTCGT CATTCGGGGC 
GTAJACGTCC AAAGATGAGC CAGG TGAATC AAAATTATAA CTAAAAGTCT ATTTTTGTCA 
GAGAATAACA AACAAATGTT GTGGTTTTCA G ATAGTAAGA GCGTTA GAAG GAGAAGTGTC 
CTTAGATGCT TTAAACGAAG GTGTGAAGCC AGGACACAGT AA CGTTTACG GGTCATTGGG 
AGCAAGCTCG GATTATAGTC AGACATCTTA CAATGCAGAC AT GAAGAAAT TCAGACAGAT 
AGCTTTGTCG AGCCAAGAAT TCCCAGTCAG .TGACTGTGAA GGAA CATCTA GTAATGATTC 
TAjGAGATATG GGAACTAAGA GCCCTACTCC TCCAAAATGA GATCGAATCA ATGATTCTGT 
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M*.SSPSSAPPTNSTSSPSPP 
ATG GCT TCT TCT CCT GAA TCT GCT CCT CCA ACA AAC TCC ACC TCT TCT CCA TCT CCA CCG 

SNTNSTTSSPPAPSPPSPTP 
TCT AAT ACC AAT TCA ACC ACC TCT TCT CCG CCG GCT CCG TCT CCT CCT TCT CCT ACA CCT 

PQGDSSSSPPPDSTSPPAPQ 
CCT CAA GGA GAC TCA TCA TCA TCG CCA CCT CCT GAT TCC ACA TCT CCA CCA GCT CCA CAA 

APNPPNSSNNSPSPPSQGGG 
GCT CCT AAC CCT CCT AAT TCC TCT AAT AAC TCT CCT TCC CCT CCG TCA CAG GGC GGT GGA 

GERGMGGNNGGNDTPPSRGS 
GGA GAA AGA GGA AAT GGA GGA AAC AAT GGT GGC AAT GAT ACT CCA CCG TCA CGC GGC TCT 

PPSPP5RSNGDNGGSRS P P 

CCT CCT TCT CCT CCT TCT AG6 ACT AAT GGA GAT AAT GGT GGT AGC AGA TCA TCG CCA CCA 

GDTGGSRSONPPSSGGS SGG 
GGA GAC ACT GGA GGC TCT CGC TCA GAC AAC CCT CCT TCT AGC GGA GGA AGC AGT GGA GGA 
GGGGRSNTN TAIIVG VLVGA 



GOT GGA GGT GGA AGA AGT AAT ACG AAT ACA GCG ATC ATA GTT GGT GTA TTA GTC GGA GCT 
GLLMIVLIIVC LRRKKKRKD 

GGA CTT TTG ATG ATC GTT CTT ATT ATT GTG TGT CTT AGA CGC AAA AAG AAG AGA AAA GAC 
SFYPEPMKGNQYQYVGNNNH 

TCC TTC TAC CCT GAA CCC ATG AAA GGA AAC CAA TAT CAA TAC TAT GGA AAC AAC AAC AAC 
NNASQNYPNWHLNSQGQMQQ 

AAC AAT GCT TCA CAG AAT TAT CCG AAT TGG CAC CTA AAT TCA CAA GGC CAA AAC CAA CAA 
ST GGWGGGGPSPPPPPRMPT 

TCT ACT GGT GGT TGG GGA GGC GGT GGA CCA TCA CCG CCT CCT CCT CCG CGG ATG CCT ACA 
SGEDSSMYSGPSRPVLPPPS 

AGC GGA GAA GAT TCT TCC ATG TAC TCA GGC CCA TCA CGC CCA GTT TTA CCT CCT CCT TCG 

PALALGFHKSTFTYQELAAA 
CCT GCT CTA GCC CTC GGA TTC AAC AAG AGC ACT TTT ACT TAC CAA GAG CTT GCG GCT GCA 

TGGFTDAMLLGQGG F G Y V H K 
ACA GGA GGG TTT ACG GAT GCT AAC CTT TTG GGA CAG GGA GGA TTT GGG TAT GTC CAT AAA 

GVUPSGKEVAVKSLKAGSGQ 
GGA GTC TTC CCT AGC GGG AAA GAA GTA GCA GTT AAG AGT TTA AAA GCG GGT AGC GGA CAA 

GEREFQAEVDI ISRVHHRYL 
GGA GAG AGG GAG TTT CAA GCT GAG GTC GAT ATC ATT AGC CGT GTG CAT CAT CCG TAT CXT 

VSLVGYCIAOGQRMLVYEFV 

GTT TCT TTG GTT GGA TAT TGC ATA GCT GAT GGA CAG AGG ATG TIG GTT TAT GAG TTT GTt 
PMKTLBYHLHGKMLPVMEFS 

CCT AAC AAA ACT TIG GAA TAT CAT CTT CAT GGG AAA AAT CTT COG GTA ATG GAG TTC TCC 
TRLRIALGAAKGLAYLHEDC 

ACT AGG TTG OCT ATC GCC TIA GGT GCT GCC AAA GGA CTC GCT TAC CTT CAC GAA GAC TCC 
HPRIIHRDIKSAKILLDFNF 

CAT CCT CGG ATC ATT CAC CGC GAC ATC AAG TCT GCA AAT ATT CTC TTG GAC TTC AAC TTT 
DAMVADFGLAKLTSDNNTHV 

GAT GCT ATG GTS GCT GAT TTT GGA TTA GCT AAG TTA ACA TCT GAT AAC AAC ACT CAT GTA 

STRVMGTFGYLAPEYASSGK 

TCT ACT CGT GTG ATG GGA ACT TTC GGA TAT CTA GCT CCA GAA TAT GCT TCA AGC GGT AAA 

LTEKSDVFSYGVMLLELITG 
TTA ACC GAG AAA TCC GAT GTT TTC TCT TAC GGA GTT ATG TTA TTG GAA CTT ATA ACT GGA 

KRPVDNS ITMDOTLVDWA R P 
AAA CGA CCG GTT GAT AAT AGC ATC ACC ATG GAC GAC ACC TTA GTA GAT TGG GCT CGG CCT 

LMARALBOGNFMELADARLE 
CTT ATG GCT CGC GCG CTA GAA GAT GGA AAC TTT AAT GAG CTC GCA GAT GOG AGG CTT GAA 

GMYNPQEMARMVTCAAAS IR 

GGC AAC TAC AAC CCG CAA GAA ATG GCT CGA ATG GTG ACT TGT GCC GCT GCT AGC ATT OCT 
HSGRKRPKMSQIVRALEGBV 

CAT TCG GGG CGT AAA CGT CCA AAG ATG AGC CAG ATA GTA AGA GCG TTA GAA GGA GAA GTG 

SLOALMEGVKPGHSNVYGSL 
TCC TTA GAT GCT TTA AAC GAA GGT GTG AAG CCA GGA CAC AGT AAC GTT TAC GGG TCA TTG 

GASSDYSQTSYNADMKKFRQ 
GGA GCA AGC TCG GAT TAT AGT CAG ACA TCT TAC AAT GCA GAC ATG AAG AAA TTC AGA CAG 

lALSSQEFPVSDCBGT _S_ S N D 
ATA GCT TTG TCG AGC CAA GAA TTC CCA GTC AGT GAC TGT GAA GGA ACA TCT AGT AAT GAT 

SRDMGTKSPTPPK* 
TCT AGA GAT ATG GGA ACT AAG AGC CCT ACT CCT CCA AAA TGA 
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TCCACCGTTT GAGAAACCCT AATAACAACA TTCAAA ATGG CGGACTCACC GGTGGATTCA 
TCTCCTGCCC CTGJUUVCCTC AAATGGGACA CCACCGTCAA ACGGAACATC GCCGTCTAAT 
GAGTCATCGC CGCCAACACC AC C T TCTT CA CCACCACCAT CATCAATATC TGCTCCTCCG 
CCAGATATCT CCGCTT CT T' l ' TTCACCGCCG CCTGCACCAC CAACGCAAGA AACGTCACCT 
CCTACATCTC CGTCCTCATC GCCGCCTGTT GTAGCTAATC CGTCACCGCA GACTCCAGAG 
AATCCTTCTC CACCTGCACC TGAAGGCTCA ACTCCTGTAA CGCCACCTGC ACCACCACAA 
ACACCGTCGA ACCAATCACC GGAAAGACCA ACTCCTCCTT CTCCTGGTGC CAATGATGAC 
CGAAACAGAA CCAATGGCGG AAACAACAAC AGAGACGGCT CCACACCATC ACCACCGTCG 
TCAGGGAACA GAACTTCCGG TGACGGTGGC TCACCTTCAC CACCTCGGTC GATAAGCCCT 
CCTCAG7LATA GTGGAGATTC AGACTCATCA TCGGG TAATC ATCCACAAGC CAACATTGGA 
TTGATTATTG GAGTCCTTGT AGGAGCAG GG (JI ' T T T GCTTC TACTTGCAGT GTGTATTTGC 
ATCTGTTGCA ACAGGAAGAA GAAGAAGAAA TCTCCTCAGG TCAACCACAT GCACTACTAC 
AATAACAATC CTTATGGAGG AGCACCCTCA GG TAATTACA GTTTAGTATA ACTGGAATTT 
AATTTGTAGC CTAATGGTGT TTGATTAGGT TTCAGAACGA TCATAGTCTA ATGGTTTCTG 
CTAGCTCCAT ATGGCAAAAG GATTAGATTT ATAAGCTAAA GGAGATGTTG CATAGTGTAG 
GTAATGGTGG TTATTACAAG GGAACACCTC AAGATCATGT GGTGAATATG GCTGGTCAAG 



GAGGTGGGAA 


TTGGGGTCCA CAGCAACCTG TGTCTGGTCC TCACAGTGAT GCTTCCAACT 


TAACCGGTCG 


AACTGCTATA CCGTCACCTC AAGCTCCAAC TCTTGGTCAC AACCAAAGCA 


CTTTCACATA 


CGATGAACTG TCCATTGCAA CAGAAGGTTT CGCTCAGTCA AATTTGCTAG 


GACAAGGAGG ATTTGGGTAT GTTCATAAAG GAGTTCTGCC TAGTGGCAAA GAAGTTGCAG 


TGAAGAGTCT 


TAAACTTGGA AGTGGACAAG GGGAACGCGA GTTTCAAGCA GAGGTTGATA 


TCATTAGCCG 


TGTCCATCAT CGTCATCTCG TTTCTCrrGT TGGATATTGC ATCTCTGGTG 


GTCAAAGACT 


TTTGGTTTAT GAGTTTATAC CTAACAACAC TCTTGAATTT CATCTTCATG 


GTACATTCAT 


CTAACAGAAT UTTTlCiiUl ATTAACAAAA CCTTTAAGTA TGGTTTCTCT 


TTAATCAGGA 


ACATGATTGA AATTTCAGGA AAGGGTCGTC CGGTTTTGGA TTGGCCTACA 


AGAGTGAAGA TTGCATTGGG ATCAGCTAGA GGCCTTGCAT ATTTGCATGA AGACTGTAAG 



AAAATCTTTA TCTCACATAT TTGCATCAG T TTCTATCTCG CTCTCTACAA TATTT6AAAG 
ATTGTATATG TTACATCAAT TATAG GTCAC CCTCGCATTA TCCACAGAGA TATCAAAGCT 
GCAAACATTC TT Crr GATTT CA GriTrG AG ACCAAGG TAT GTGTGTATAT ATCGACTCTT 
GTACTACTTT TACTTTCATT GTCTCTCATT TTTGTTTCCA ATCTGTGTCG ATGTGTGTAT 
CAGTCTTATT GTGTAAATAT ATGCAG GTGG CAG A ' i ' T ' r ' I 'GG ATTGGCTAAG CTATCTCAAG 
ACAACTATAC TCATGTCTCC ACTCGCGTCA TGGGAA Cm ' TGGG TAAGCA GCTTTGTAAA 
ATGTCTCAAC TCATCCACAC TTATTTAGTT TCTTTCACTT GTTTTTAACA TTTTCTTGGA 
TTCAG ATACT TAGCTCCAGA GTATGCATCA AGCGGAAAGT TATCCGACAA ATCrTGATGTT 
TTCTCATTTG GAGTAATGCT TCTTGAGCTC ATAACCGGAA GACCTCCTCT GGATCTAACT 
GGAGAAATGG AAGATAGCTT GGTAGATTGG GTAAGTCGGT CCCCGCCTCT TCGGTTTACT 
TGTTTAATCC CAAAACACTT TCCAAAGCAA AAACAGAAAC AAATCTTACT ATTGTTGTTG 
CA GGCAAGGC CTTTGTGTTT GAAAGCAGCT CAAGATGGAG ATTACAACCA ATTGGCTGAT 
CCACGTCTAG AGCTAAACTA CAGTCATCAA GAGATGGTTC AAATGGCTTC TTGTGCAGCT 
GCAGCAATCA GACATTCAGC AAGAAGACGG CCTAAGATCA GCCAGGTTCA AAAACTCATA 
CCAe-j.-x\in\3 GTTCTATTG T TATATTTTTA CTCACAATTA ATCTTGATGA TAAATGTGAC 
ATACTAATGA ATCTTGAAAC ATGTGTATGG TAAATGAAAA GATTGTACGA GCACTAGAAG 
GAGATATGTC AATGGATGAT CTAAGTGAGG GAACAAGACC AGGACAAAGC ACGTACTTGA 
GCCCCGGGAG CGTGAGCTCA GAGTATGACG CAAGCTCGTA CACGGCAGAC ATGAAAAAGT 
TCAAGAAACT GGCGTTAGAG AATAAAGAAT ATCAAAGCAG TGAATATGGT GGAACAAGTG 
AGTATGGgTT AAACCCTTCT GCTTCAAGTA GTGAAGAAAT GAATAGAGGC TCAATGAAAC 
GCAATCCTCA GCTTTGAA AG AAGAGACAAC ACTTGTCATA ATATTTCAGT TTTCTTCTCT 
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err TTG CTT CTA CTT GCA GTG TGT ATT TGC ATC TGT TGC AAC AGG AAG AAG AAG AAG AAA 
SPQVNHMHYYNNNPYGGAPS 

TCT CCT CAG GTC AAC CAC ATG CAC TAC TAC AAT AAC AAT CCT TAT GGA GGA GCA CCC TCA 
GNGGYYKGTPQDHVVNMAGQ 

GGT AAT GGT GGT TAT TAC AAG GGA ACA CCT CAA GAT CAT GTG GTG AAT ATG GCT GCT CAA 

GGGNWGPQQPVSGPHSDASN 

GGA GGT GGG AAT TGG GGT CCA CAG CAA CCT GTC TCT GGT CCT CAC AGT GAT GCT TCC AAC 
I^TGRTAIPSPQAATLGHNQS 

TTA ACC GCT CGA ACT GCT ATA CCG TCA CCT CAA GCT GCA ACT CTT GGT CAC AAC CAA AGC 
TFTYDELS lATEGFAQSMLL 

ACT TTC ACA TAC GAT GAA CTG TCC ATT GCA ACA GAA GCT TTC GCT CAG TCA AAT TTG CTA 
GQGGFGYVHKGVLPSGKEVA 

GGA CAA GGA GGA TTT GGG TAT GTT CAT AAA GGA GTT CTG CCT ACT GGC AAA GAA GTT GCA 

VKSLKLGSGQGEREFQAEVD 

GTC AAG ACT CTT AAA CTT GGA ACT GGA CAA GGG GAA CGC GAG TTT CAA GCA GAG GTT GAT 
I ISRVHHRHLVSLVGYCI SG 

ATC ATT AGC CGT GTC CAT CAT CCT CAT CTC GTT TCT CTT GTT GGA TAT TGC ATC TCT GGT 
GQRLLVYEF IPNNTLE F H L H 

GCT CAA AGA CTT TTG GTT TAT GAG TTT ATA CCT AAC AAC ACT CTT GAA TTT CAT CTT CAT 
GKGRPVLDWPTRVKIALGSA 

GGA AAG GGT CGT CCG GTT TTG GAT TGG CCT ACA AGA GTG AAG ATT GCA TTC GGA TCA GCT 
RGLAYLHEDCKKIFISHICI 

AGA GGC CTT GCA TAT TTC CAT GAA GAC TCT AAC AAA ATC TTT ATC TCA CAT ATT TCC ATC 
SHPRI IHRDIKAANI1.LD F S 

ACT CAC CCT CGC ATT ATC CAC AGA GAT ATC AAA GCT GCA AAC ATT CTT CTT GAT TTC AGT 
FETKVADFGLAKLSQDNYTH 

TTT GAG ACC AAG GTC GCA GAT TTT GGA TTC GCT AAG CTA TCT CAA GAC AAC TAT ACT CAT 
VSTRVMGTFGYLAPEYAS SG 

GTC TCC ACT CGC GTC ATC GGA ACT TTT GGA TAC TTA GCT CCA GAG TAT GCA TCA AGC GGA 
KLSDKSDVFSFGVML L E L I T 

AAG TTA TCC GAC AAA TCT GAT GTT TTC TCA TTT GGA CTA ATC CTT CTT GAG CTC ATA ACC 
GRPPLDLTGEMEDSLVOWAR 

GGA AGA CCT CCT CTC GAT CTA ACT GGA GAA ATC GAA GAT AGC TTC GTA GAT TGG GCA AGG 
PLCLKAAQDGDYNQLADP RL 

CCT TTC TCT TTC AAA GCA GCT CAA GAT GGA GAT TAC AAC CAA TTC GCT GAT CCA CGT CTA 
ELNYSHQEMVQMASCAAAA I 

GAG CTA AAC TAC AGT CAT CAA GAG ATC GTT CAA ATC GCT TCT TCT GCA GCT GCA GCA ATC 
RHSARRRPKMSQVQKLIPUV 

AGA CAT TCA GCA AGA AGA CGG CCT AAG ATC AGC CAC GTT CAA AAA CTC ATA CCA CTT GTT 
GSI IVRALEGOMSMDDLSEG 

GCT TCT ATT ATT GTA CGA GCA CTA GAA GGA GAT ATC TCA ATC GAT GAT CTA ACT GAG GGA 
TRPGQSTYLSPGSVSSEYDA 

ACA AGA CCA GGA CAA AGC ACG TAC TTC AGC CCC GGG AGC GTC AGC TCA GAG TAT GAC GCA 
SSYTADMKKFKKLALENKEY 

AGC TCG TAC ACG GCA GAC ATC AAA AAG TTC AAG AAA CTC GCG TTA GAG AAT AAA GAA TAT 
QSSEYGGTSEYGLNPSAS SS 

CAA AGC AGT GAA TAT GCT GGA ACA AGT GAG TAT GGC TTA AAC CCT TCT GCT TCA ACT ACT 
EEMNRGSMKRNPQL* 

GAA GAA ATC AAT AGA GGC TCA ATC AAA CGC AAT CCT CAG CTT TGA 
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CATACATCAC AAAACGGCAT CGTTTTG ATG TCGCTCTCTC CTTCTTCCTC GgCGGCTCCG 
GCAACTTCCC CGCCTGCCAT GTCATTACCT CCGGCAGATT CCGTACCTGA CACGTCATCA 
CCTCCAGCTC CTC L"! " ! T GTC TCCTCTTCCC CCACCATTGA GCTCTCCTCC GCCGTTGCCT 
TCACCACCGC CTCTCTCCGC TCCCACCGCT TCCCCACCGC CTCTTCCGGT TGAATCCCCA 
CCGTCTCCTC CTATAGAATC ACCACCGCCT CCTCTACTGG AATCACCTCC TCCTCCTCCG 
TTGGAATCTC CATCTCCACC GTCTCCTCAC GTCTCAGCTC CTTCCGGTTC ACCGCCATTA 
CCCTTCCTTC CCGCCAAACC TTCTCCGCCG C C TTCTTCAC CTCCCTCCGA GACAGTTCCG 
CCGGGAAATA CGATTTCTCC ACCACCTCGT TCACTTCCCT CCGAATCAAC CCCGCCGGTG 
AACACAGCTT CTCCTCCACC GCCATCTCCT CCTCGCCGCC GTAGTGGCCC TAAGCCTTCG 
TTTCCTCCTC CCATCAATTC TTCTCCACCA AATCCTTCTC CGAACACTCC GTCACTCCCA 
GAAACTTCTC CTCCACCTAA ACCACCGCTC TCAACGACGC CATTTCCCTC CTCATCCACT 
CCCCCGCCTA AGAAGTCCCC TGCAGCAGTA ACTCTTCCTT TCT'l'l'GGGCC AGCGGGCCAA 
TTACCGGATG GGACCGTAGC ACCTCCTATT GGGCCTGTTA TAGAGCCCAA GACGAGTCCA 
GCCGAATCAA TATCTCCGGG AACGCCACAG CCACTGGTTC CGAAGAGTCT ACCTGTAACG 
ACGTCGTATC ACCGATCATC CGCCGGATTC TTATTTGGCG GTGTAATCGT TGGAGCTCTT 
CTACTAATTC TGTTAGGTCT TCTCTTTGTC TTCTACAGAG CTACCAGAAA TAGAAATAAC 
AACAGCAGCT CTGCTCATCA TCAATCCAAA ACTCCCTCAA AAGG TATAAA CTTTGAGATC 
AATTTGTTTC AGACTGTCAC TTAAAATGTA ATGTAATTGT AAGTTATGAT CTTTTTTGCT 
TGGATCAG TT CAACATCATC GGGGCGGTAA TGCTGGTACG AACCAGGCAC ATGTTATCAC 
AATGCCACCA CCAATCCATG CTAAATATAT ATCTAGTGGA GGTTGTGATA CGAAGGAGAA 
CAATTCTGTT GCGAAAAACA TTTCAATGCC ATCTGGAATG TTCTCCTACG AAGAACTTTC 
AAAAGCAACT GGTGGATTTT CAGAGGAGAA CC'l'TrTGGGA GAAGGCGGTT TCGGATATGT 
TCACAAAGGA GTGTTGAAAA ACGGGACAGA AGTTGCGGTG AAGCAGCTGA AGATTGGGAG 
CTATCAAGGG GAAAGAGAAT TCCAAGCTGA GGTTGACACA ATCAGTAGGG TTCATCATAA 
GCACCTCGTT TCATTGGTTG GTTATTGCGT T7ATGGAGAT AAAAGACTCT TGGTTTACGA 
GTTTGTTCCT AAAGATACCT TGGAGTTCCA CTTGCATGG T AAAATAGATA TATGATTTCA 
TCCTTTTTGA TTTTGTCTTT AGTTCATTAT GTTGAGTATT GTGAGAATAT GTGTTGTAGA 
GAACAGAGGA AGCGTGTTGG AATGGGAAAT GAGGCTCAGG ATTGCTGTAG GAGCAGCAAA 
AGGATTAGCT TATCTTCATG AGG A TT G TGA GTTGTTTCTC TTCATAATGG GAATGACAAT 
GGCCTTTTTT GGGTTTTCGA CACTGATATT GATTTCTGGT GCTTGTAATG CAG GCAGTCC 
AACTATAATT CACCGTGATA TCAAAGCAGC TAATATCCTT CTAGATTCCA AATTTGAGGC 
AAAGG TGATC TGCTCTTTAA TCTTATCAAA GTTTGGTTTT TAGAACAGAG TTTGCCCATT 
TTCGGTTTAT AGTCACACCA TTTTGTTCTA TTTAACAAGA TTACATGTAG CTTAGAGTTC 
TTGTTCTCTT CAG GTCTCTG A CTrT GGACT AGCCAAGTTT TTCTCAGACA CCAATTCATC 
ATTCACTCAT ATCTCTACTC GAGTGGTAGG AACTTTCGGG TAAAACACCA TCCATCCATG 
CTTTATATGT TGTGTCATTG TGTTTAAAAA TATTAATTTA CGGTTTCAAG TTTCAAGTTT 
CAAGTTTCAA GTTTCTTCCT TGTTGTATCG TAAATCCAGA TCAAAAGATT TATTGATTAC 
TAAATGCCTT GTGCACCGTC TATTTGGTAT GCATTTAAAC AG ATACATGG CTCCAGAATA 
CGCGTCCAGT GGTAAAGTAA CTGATAAATC AGATGTATAT TCCTTTGGGG TCGTGCTTCT 
AGAACTCATC ACTGGACGTC CATCAAl l ' TT CGCCAAAGAT TCTTCCACAA ACCAGAGTTT 
AGTAGACTGG GTAAGTCAAA GTACATGATG ATGATGATGA TACCATTAGG TTTTCTTTTT 
CCCACTAGTT ATTAGATGAA AATGAAACTC CAGTATAACT GCAAGCTTGA AACTTCATTA 
GACTGTAAAT TTTGATTATT CCTCAG GCGA GGCCATTGCT TACGAAAGCA ATCTCTGGAG 
AAAGTTTTGA CTTTCTTGTA GACTCAAGGT TGGAGAAGAA TTACGATACA ACTCAGATGG 
CAAACATGGC TGCTTGTGCT GCTGCTTGCA TACGCCAATC AGCTTGGCTT CGGCCTAGAA 
TGAGCCAGG T CTGAGATTTA GTTTAAATAC ATGTATTCGT CCATATATCC GAAAGGAACA 
AGCCTAATCC ATGAATACAT TTATATCTTG AAAAAAACTT GAATAG GTAG TACGTGCTCT 
TGAAGGCGAG GTGGCCCTGA GAAAGGTCGA AGAGACTGGG AATAGCGTGA CCTATAGCTC 
TTCTGAAAAC CCGAATGACA TCACACCACG GTATGGAACA AATAAGAGGA GATTCGACAC 
AGGTTCAAGC GATGGTTACA CTTCAGAATA TGGAGTTAAC CCTTCTCAGT CGAGCAGTGA 
ACATCAACAG GTGAATACTT AGTTCACAGG TTCAATAGGG CAAGTTTCAC CACAATTATT 
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MSLSP^SSPAPATSPPAMSL 
ATG TCC CTC TCT CCT TCT TCC TCC CCC GCT CCG GCA ACT TCC CCG CCT GCC ATG TCA TTA 

PPADSVPDTSSPPAPPLSPL 
CCT CCG GCA GAT TCC GTA CCT CAC ACG TCA TCA CCT CCA GCT CCT CCT TTG TCT CCT CTT 

PPPLSSPPPLPSPPPLSAPT 
CCC CCA CCA TTG AGC TCT CCT CCG CCG TTG CCT TCA CCA CCG CCT CTC TCC GCT CCC ACC 

ASPPPLPVESPPSPPIESPP 
GCT TCC CCA CCG CCT CTT CCG GTT GAA TCC CCA CCG TCT CCT CCT ATA GAA TCA CCA CCG 

PPLLESPPPPPLESP S P P S P 
CCT CCT CTA CTG GAA TCA CCT CCT CCT CCT CCG TTG GAA TCT CCA TCT CCA COG TCT CCT 

HVSAPSGSPPLPFLPAKP P 
CAC CTC TCA GCT CCT TCC GGT TCA CCG CCA TTA CCC TTC CTT CCC GCC AAA CCT TCT CCC 

PPSSPPSETVPPGNTISPPP 
CCG CCT TCT TCA CCT CCC TCC GAG ACA GTT CCG CCG GGA AAT ACG ATT TCT CCA CCA CCT 

RSLPSESTPPVNTAS PPPPS 
CCT TCA CTT CCC TCC GAA TCA ACC CCG CCG GTG AAC ACA GCT TCT CCT CCA CCG CCA TCT 

PPRBRSGPKPSFPPPINSSP 
CCT CCT CGC CGC CGT ACT GGC CCT AAG CCT TCC TTT CCT CCT CCC ATC AAT TCT TCT CCA 

PNPSPNTPSLPETSPPPKPP 
CCA AAT CCT TCT CCG AAC ACT CCG TCA CTC CCA GAA ACT TCT CCT CCA CCT AAA CCA CCC 
USTTPFPSSSTPPPKKSPAA 

CTC TCA ACG ACG CCA TTT CCC TCC TCA TCC ACT CCC CCG CCT AAG AAG TCC CCT GCA CCA 
VTLPFFGPAGQLPDGTVAP P 

GTA ACT CTT CCT TTC TTT GGG CCA GCG GGC CAA TTA CCG GAT GGG ACC GTA GCA CCT CCT 

IGPVIEPKTSPAESISPGTP 
ATT GGG CCT GTT ATA GAG CCC AAC ACG ACT CCA GCC GAA TCA ATA TCT CCG GGA ACG CCA 

Q PLVPKSLPVTTSYHRS S A G 
GAG CCA CTG GTT CCG AAG ACT CTA CCT GTA ACG ACG TCG TAT CAC CCA TCA TCC GCC GGA 
F L F G G VIVGALLLT 1; L G L_ 



TTC TTA TTT GGC GGT GTA ATC CTT GGA GCT CTT CTA CTA ATT CTG TTA GGT CTT CTC TTT 

F_ YRATRMRHNMSSSAHHQ S 

GTC TTC TAC ACA GCT ACC AGA AAT AGA AAT AAC AAC AGC AGC TCT GCT CAT CAT CAA TCC 
KTPSKVQHHRGGNAGTNQAH 

AAA ACT CCC TCA AAA GTT CAA CAT CAT CGC GGC GGT AAT GCT GGT ACG AAC CAG GCA CAT 
VITMPPPIHAKYISSGGCDT 

GTT ATC ACA ATG CCA CCA CCA ATC CAT GCT AAA TAT ATA TCT ACT GGA GGT TCT GAT ACG 
KENNSVAKHISMPSGMPSYB 

AAC GAG AAC AAT TCT CTT GCC AAA AAC ATT TCA ATC CCA TCT GGA ATG TTC TCC TAC GAA 
ELSKATCGFSEBMLLCBGGF 

GAA CTT TCA AAA GCA ACT GCT GGA TTT TCA GAG GAG AAC CTT TTG GGA GAA GGC GCT TTC 
G YVHKGVLKNGTEVA V K Q L K 

GGA TAT GTT CAC AAA GGA GTG TTG AAA AAC GGG ACA GAA GTT GCC GTG AAG CAG CTG AAG 
IGSYQGERBFQAEVOTISRV 

ATT GGG AGC TAT CAA GGG GAA AGA GAA TTC CAA GCT GAG GTT CAC ACA ATC ACT ACQ GTT 
HHKHLVSLVGYCVMGOKRLL 

CAT CAT AAG CAC CTC GTT TCA TIG CTT GCT TAT T6C GTT AAT GGA GAT AAA AGA CTC TTG 
VYEPVPKDTLBFHI.HEMRGS 

GTT TAC GAG TTT CTT CCT AAA GAT ACC TTG GAG TTC CAC TTG CAT CAG AAC AGA GGA AGC 
VLEWEMRLRIAVGAAKGLAY 

GTG TTG GAA TGG GAA ATG AGC CTC AGG ATT GCT CTA GGA CCA GCA AAA GGA TTA GCT TAT 
LHEDCSPTIIHRDIKAANIL 

CTT CAT GAG GAT TCC ACT CCA ACT ATA ATT CAC CCT GAT ATC AAA GCA GCT AAT ATC CTT 
LDSKFEAKVSOFGLAKFFSO 

CTA GAT TCC AAA TTT GAG GCA AAG GTC TCT CAC TTT GGA CTA GCC AAG TTT TTC TCA GAC 
TNSS PTHISTRVVGTFGYMA 

ACC AAT TCA TCA TTC ACT CAT ATC TCT ACT CCA CTG CTA GGA ACT TTC GGA TAC ATG GCT 
PEYASSGKVTDKSDVYSFGV 

CCA GAA TAC GCG TCC ACT GCT AAA GTA ACT GAT AAA TCA GAT GTA TAT TCC TTT GGG GTC 
VLLELITGRPSIFAKD S T N 

GTG err CTA GAA CTC ATC ACT GGA OCT CCA TCA ATT TTC GCC AAA GAT TCT TCC ACA AAC 
QSLVDWARPLLTKAISGESF 

CAG ACT TTA GTA GAC TGG GCC AGG CCA TTG CTT ACC AAA GCA ATC TCT GGA GAA ACT TTT 
D FLVOSRLBKKVDTTQMAHM 

GAC TTT CTT GTA GAC TCA AGC TTG GAG AAG AAT TAC GAT ACA ACT CAC ATC GCA AAC ATC 
AACAAACIRQSAWLRPRMSQ 

GCT GCT TCT CCT GCT GCT TCC ATA CGC CAA TCA GCT TGG CTT CGC CCT AGA ATC ACC CAG 
VVRALEGBVALRKVEETGNS 

GTA GTA CCT GCT CTT GAA GGC GAG GTG GCC CIC AGA AAG CTC GAA GAG ACT GGC AAT AGC 
VTYSSSBNPHDITPRYGTMK 

GTG ACC TAT AGC TCT TCT GAA AAC CCG AAT GAC ATC ACA CCA CGG TAT GCA ACA AAT AAG 
RRFDTGS5DGYTSEYGVMPS 

AGG AGA TTC GAC ACA GCT TCA AGC GAT CCT TAC ACT TCA GAA TAT GGA CTT AAC CCT TCT 
QSSSEHQQVMT* 

CAG TCG AGC ACT CAA CAT CAA CAG GIG AAT ACT TAG 
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As a below named inventor, I hereby declare that 

My residence, post office address and citizenship are as stated below next to my name, 

I beKeve I am the original, first and sole inventor (if only one name is listed below) or an original, 
first and joint inventor (if plural names are listed below) of the subject matter vA^ich is claimed and for 
which a patent is sought on the invention entitled 

Proline-rich Extensin-like Receptor Kinases 



the specification of which 

(check one) 

□ is attached hereto. 

B was filed on 18 August 2000 



as United States Application No, or PCT International 



Application Number PCT/CAOO/0096g 
and was amended on 



(if applicable) 

1 hereby state that I have reviewed and understand the contents of the above identified specification, 
including the claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose to the United States Patent and Trademark Office all information 
known to me to be material to patentability as defined in Title 37, Code of Federal Regulations, 
Section 1.56. 

I hereby claim foreign priority benefits under Title 35. United States Code, Section Il9(a)-(d) or 
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i 13^1} name of soto or Aral Inventor 
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Sole or rirftt inventor^ signature r ^ Q . , ^ 
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Nancy Silva 
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SEQUENCE LISTING 
<110> GORING, Daphne R. et al . 

<120> PROLINE-RICH EXTENSIN-LIKE RECEPTOR KINASES 

<130> P 25, 762 USA 

<140> US 10/069,304 
<141> 2002-02-19 

<150> PCT/CAOO/00966 
<151> 2000-08-18 

<150> US 60/149,466 
<151> 1999-08-19 

<150> US 60/159,122 
<151> 1999-10-13 

<160> 27 

<170> Patentin Ver. 2.1 

<210> 1 
<211> 1944 
<212> DNA 

<213> Brassica napus 

<220> 

<221> CDS 

<222> (1) . . (1944) 

<400> 1 

atg tec teg gcg ccg tct ccg ggg act ggt teg cct cca tct cca cca 48 

Met Ser Ser Ala Pro Ser Pro Gly Thr Gly Ser Pro Pro Ser Pro Pro 
15 10 15 

tea aac tec aca acc aec act cct cct cca get tec get cct cct cec 96 
Ser Asn Ser Thr Thr Thr Thr Pro Pro Pro Ala Ser Ala Pro Pro Pro 
20 25 30 

acc aca cct tct tct cct ccg ccg cca tec act att ccg aca tct cct 144 
Thr Thr Pro Ser Ser Pro Pro Pro Pro Ser Thr lie Pro Thr Ser Pro 
35 40 45 

cct cct tct tct egc tct aca cct tct get cct cct cca tct cca cca 192 
Pro Pro Ser Ser Arg Ser Thr Pro Ser Ala Pro Pro Pro Ser Pro Pro 
50 55 60 

act cca tct acg ccg gga tct cca cct cct ett cct cag ccg tct cca 240 
Thr Pro Ser Thr Pro Gly Ser Pro Pro Pro Leu Pro Gin Pro Ser Pro 
65 70 75 80 



ccc get cca act acg cec gga tct cca cec gca cct gtt act cct cct 
Pro Ala Pro Thr Thr Pro Gly Ser Pro Pro Ala Pro Val Thr Pro Pro 

85 90 95 



288 
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act cga aac cct cca cct tea gtc cca gga cca ccg tec aat ect tea 336 
Thr Arg Asn Pro Pro Pro Ser Val Pro Gly Pro Pro Ser Asn Pro Ser 
100 105 110 

cgc gaa gga gga tct cct cga cct cca tct tct ccc teg ccg ccg tet 384 
Arg Glu Gly Gly Ser Pro Arg Pro Pro Ser Ser Pro Ser Pro Pro Ser 
115 120 125 

cct tct tec gac ggt tta tea aca gga gtg gtg gtg gga ate gee ate 432 
Pro Ser Ser Asp Gly Leu Ser Thr Gly Val Val Val Gly lie Ala lie 
130 135 140 



480 



528 



672 



720 



gga gga gtc get ctg ctt gtg ata gtg act ctg att tgt etc etc tgt 
Gly Gly Val Ala Leu Leu Val lie Val Thr Leu lie Cys Leu Leu Cys 
145 150 155 160 

aag aag aaa cga egg aga gac gaa gaa gat get tac tat gtt ect ccg 
Lys Lys Lys Arg Arg Arg Asp Glu Glu Asp Ala Tyr Tyr Val Pro Pro 

165 170 175 

cca cct cct cct ggt ccc aaa gee gga gga cct tac ggt gga cag cag 576 
Pro Pro Pro Pro Gly Pro Lys Ala Gly Gly Pro Tyr Gly Gly Gin Gin 
180 185 190 

caa caa tgg egg caa caa aac gca aca cca ccg tea gat cat gtc gtg 624 
Gin Gin Trp Arg Gin Gin Asn Ala Thr Pro Pro Ser Asp His Val Val 
195 200 205 

acg tea eta cca cca cca ect aag get cca tet cca cca egg caa ect 
Thr Ser Leu Pro Pro Pro Pro Lys Ala Pro Ser Pro Pro Arg Gin Pro 
210 215 220 

cct cca cct cca cca ccg cct ttc atg age age age ggc ggc tec gac 
Pro Pro Pro Pro Pro Pro Pro Phe Met Ser Ser Ser Gly Gly Ser Asp 
225 230 235 240 

tac teg gac cgt cca gtt ctt cct cca ccg tet cca ggg ctt gtg tta 768 
Tyr Ser Asp Arg Pro Val Leu Pro Pro Pro Ser Pro Gly Leu Val Leu 

245 250 255 

ggc ttc tec aaa age act ttc aca tac gag gag eta get aga gee ace 
Gly Phe Ser Lys Ser Thr Phe Thr Tyr Glu Glu Leu Ala Arg Ala Thr 
260 265 270 

aat ggt ttc tec gag gcg aac ttg tta gga caa ggc ggg ttc ggt tac 
Asn Gly Phe Ser Glu Ala Asn Leu Leu Gly Gin Gly Gly Phe Glv Tvr 
275 280 285 

gtg cac aaa ggt gtg ttg cct agt ggg aaa gaa gtt get gtg aag cag 912 
Val Hxs Lys Gly Val Leu Pro Ser Gly Lys Glu Val Ala Val Lys Gin 
290 295 300 

ttg aaa gtt ggg agt ggt cag gga gag agg gag ttt cag gca gag gtt 960 
Leu Lys Val Gly Ser Gly Gin Gly Glu Arg Glu Phe Gin Ala Glu Val 
305 310 315 320 

gag ate ate age aga gtt cac cac agg cat ctg gtg tct ctt gtt ggt 1008 



816 



864 



Glu lie lie Ser Arg Val His His Arg His Leu Val Ser Leu Val Gly 

325 330 335 

tat tgc ate gcc ggt gcc aaa aga ttg ctt gtc tat gag ttt gtt cct 1056 

Tyr Cys lie Ala Gly Ala Lys Arg Leu Leu Val Tyr Glu Phe Val Pro 
340 345 350 

aac aac aat etc gag ctt cac etc cat ggc gag gga egg eet aea atg 1104 

Asn Asn Asn Leu Glu Leu His Leu His Gly Glu Gly Arg Pro Thr Met 
355 360 365 

gaa tgg age aec aga ttg aag att get ctt gga tet get aaa gga ctt 1152 

Glu Trp Ser Thr Arg Leu Lys lie Ala Leu Gly Ser Ala Lys Gly Leu 

370 375 380 



tet tat ctt cat gaa gat tgc aat eet aaa ate att cac cgt gat ate 
Ser Tyr Leu His Glu Asp Cys Asn Pro Lys lie lie His Arg Asp lie 
385 390 395 400 



1200 



aag get tea aac ata ttg ata gat tte aag ttt gaa get aag gtt get 1248 
Lys Ala Ser Asn lie Leu lie Asp Phe Lys Phe Glu Ala Lys Val Ala 

405 410 415 

gat ttt ggt ctt get aag att get tet gat aea aae acg eat gta tea 1296 
Asp Phe Gly Leu Ala Lys lie Ala Ser Asp Thr Asn Thr His Val Ser 
420 425 430 

aea cgt gtg atg gga aec ttt ggg tac ttg get ecg gaa tae get gca 1344 
Thr Arg Val Met Gly Thr Phe Gly Tyr Leu Ala Pro Glu Tyr Ala Ala 
435 440 445 

age gga aag etc acg gag aag tet gae gtt tte tea ttt ggc gtt gtg 13 92 
Ser Gly Lys Leu Thr Glu Lys Ser Asp Val Phe Ser Phe Gly Val Val 
450 455 460 



1440 



1488 



ctt ttg gag etc att act gga cgt cga ecc gtt gat gee aae aat gtc 
Leu Leu Glu Leu lie Thr Gly Arg Arg Pro Val Asp Ala Asn Asn Val 
465 470 475 480 

tat gta gat gae age tta gtt gae tgg gca cga cea ttg ctt aae cga 
Tyr Val Asp Asp Ser Leu Val Asp Trp Ala Arg Pro Leu Leu Asn Arg 

485 490 495 

gca tet gag caa gga gae ttt gag ggt tta get gat gca aag atg aat 1536 
Ala Ser Glu Gin Gly Asp Phe Glu Gly Leu Ala Asp Ala Lys Met Asn 
500 505 510 

aat ggg tat gae aga gag gag atg get cge atg gtt get tgt get geg 15 84 
Asn Gly Tyr Asp Arg Glu Glu Met Ala Arg Met Val Ala Cys Ala Ala 
515 520 525 

get tgt gtt ege cat tea get cge cgc aga cct cge atg age cag att 1632 
Ala Cys Val Arg His Ser Ala Arg Arg Arg Pro Arg Met Ser Gin He 
530 535 540 

gtg cgt gcg tta gaa gga aat gta tea ctg tea gat ett aae gaa ggg 1680 
Val Arg Ala Leu Glu Gly Asn Val Ser Leu Ser Asp Leu Asn Glu Gly 
545 550 555 560 



atg aga cca ggt caa age 
Met Arg Pro Gly Gin Ser 

565 

gat tat gac teg age cag 
Asp Tyr Asp Ser Ser Gin 
580 

atg gca ctt ggaact caa 
Met Ala Leu Gly Thr Gin 
595 

ccg acc agt gac tat gga 
Pro Thr Ser Asp Tyr Gly 
610 

caa acc aca cgc gaa atg 
Gin Thr Thr Arg Glu Met 

625 630 

ggt tat agt gga cct tct 
Gly Tyr Ser Gly Pro Ser 

645 



aat gta tac age tea tac 
Asn Val Tyr Ser Ser Tyr 

570 

tac aat gaa gac atg aag 
Tyr Asn Glu Asp Met Lys 

585 

gag tac aac gee acg ggt 
Glu Tyr Asn Ala Thr Gly 
600 

ctg tac ccg tct ggt tea 
Leu Tyr Pro Ser Gly Ser 
615 620 

gag atg ggg aag att aag 
Glu Met Gly Lys lie Lys 

635 

ctt taa 
Leu 



gga gga age ace 1728 
Gly Gly Ser Thr 
575 

aag ttt agg aaa 1776 
Lys Phe Arg Lys 
590 

gag tac agt aat 1824 

Glu Tyr Ser Asn 

605 

age age gag ggc 1872 
Ser Ser Glu Gly 



aga acc ggt cag 1920 
Arg Thr Gly Gin 
640 

1944 
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Pro 
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Gin 


Gin 
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His 
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Ser 
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Pro 


Ser 


Pro 


Pro 


Arg 


Gin 
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210 

Pro Pro Pro Pro 
225 

Tyr Ser Asp Arg 

Gly Phe Ser Lys 
260 

Asn Gly Phe Ser 
275 

Val His Lys Gly 
290 

Leu Lys Val Gly 
305 

Glu lie lie Ser 

Tyr Cys lie Ala 
340 

Asn Asn Asn Leu 

355 

Glu Trp Ser Thr 
3 70 

Ser Tyr Leu His 

385 

Lys Ala Ser Asn 

Asp Phe Gly Leu 
420 

Thr Arg Val Met 
435 

Ser Gly Lys Leu 
450 

Leu Leu Glu Leu 
465 

Tyr Val Asp Asp 

Ala Ser Glu Gin 
500 

Asn Gly Tyr Asp 
515 

Ala Cys Val Arg 

530 

Val Arg Ala Leu 
545 

Met Arg Pro Gly 

Asp Tyr Asp Ser 
580 

Met Ala Leu Gly 

595 

Pro Thr Ser Asp 
610 

Gin Thr Thr Arg 
625 

Gly Tyr Ser Gly 



215 

Pro Pro Pro Phe 
230 

Pro Val Leu Pro 

245 

Ser Thr Phe Thr 

Glu Ala Ash Leu 

280 

Val Leu Pro Ser 
295 

Ser Gly Gin Gly 

310 

Arg Val His His 
325 

Gly Ala Lys Arg 

Glu Leu His Leu 
360 

Arg Leu Lys lie 

375 

Glu Asp Cys Asn 
390 

lie Leu lie Asp 
405 

Ala Lys lie Ala 

Gly Thr Phe Gly 
440 

Thr Glu Lys Ser 
455 

lie Thr Gly Arg 
470 

Ser Leu Val Asp 
485 

Gly Asp Phe Glu 

Arg Glu Glu Met 
520 

His Ser Ala Arg 

535 

Glu Gly Asn Val 
550 

Gin Ser Asn Val 
565 

Ser Gin Tyr Asn 

Thr Gin Glu Tyr 

600 

Tyr Gly Leu Tyr 
615 

Glu Met Glu Met 
630 

Pro Ser Leu 
645 



220 

Met Ser Ser Ser 

235 

Pro Pro Ser Pro 
250 

Tyr Glu Glu Leu 
265 

Leu Gly Gin Gly 

Gly Lys Glu Val 
300 

Glu Arg Glu Phe 

315 

Arg His Leu Val 
330 

Leu Leu Val Tyr 
345 

His Gly Glu Gly 

Ala Leu Gly Ser 

380 

Pro Lys lie lie 
395 

Phe Lys Phe Glu 
410 

Ser Asp Thr Asn 
425 

Tyr Leu Ala Pro 

Asp Val Phe Ser 
460 

Arg Pro Val Asp 
475 

Trp Ala Arg Pro 
490 

Gly Leu Ala Asp 
505 

Ala Arg Met Val 

Arg Arg Pro Arg 
540 

Ser Leu Ser Asp 
555 

Tyr Ser Ser Tyr 
570 

Glu Asp Met Lys 
585 

Asn Ala Thr Gly 

Pro Ser Gly Ser 
620 

Gly Lys lie Lys 
635 



Gly Gly Ser Asp 

24 0 

Gly Leu Val Leu 
255 

Ala Arg Ala Thr 
270 

Gly Phe Gly Tyr 
285 

Ala Val Lys Gin 

Gin Ala Glu Val 

320 

Ser Leu Val Gly 
335 

Glu Phe Val Pro 
350 

Arg Pro Thr Met 
365 

Ala Lys Gly Leu 

His Arg Asp lie 
400 

Ala Lys Val Ala 
415 

Thr His Val Ser 
430 

Glu Tyr Ala Ala 
445 

Phe Gly Val Val 

Ala Asn Asn Val 
480 

Leu Leu Asn Arg 
495 

Ala Lys Met Asn 
510 

Ala Cys Ala Ala 

525 

Met Ser Gin lie 

Leu Asn Glu Gly 
560 

Gly Gly Ser Thr 
575 

Lys Phe Arg Lys 
590 

Glu Tyr Ser Asn 

605 

Ser Ser Glu Gly 

Arg Thr Gly Gin 
640 
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<211> 2189 
<212> DNA 

<213> Brassica napus 



<220> 

<221> CDS 

<222> (1) . . (2189) 

<223> 



<400> 3 

ttaactctcf ggtctccgtg tctcctctct tctcctgctg cttcctttta acactctctt 60 

catttgcctt tttgatttag atccaaagaa gcagac atg tec teg geg ceg tet 114 

Met Ser Ser Ala Pro Ser 
1 5 

ceg ggg act ggt teg cct cca tet cca cca tea aac tec aca acc acc 162 
Pro Gly Thr Gly Ser Pro Pro Ser Pro Pro Ser Asn Ser Thr Thr Thr 
10 15 20 



act cct cet cca get tec get cct cct cce acc aca cct tet tet cct 
Thr Pro Pro Pro Ala Ser Ala Pro Pro Pro Thr Thr Pro Ser Ser Pro 
25 30 35 



210 



ceg ceg cca tec act att ceg aca tet cct cet cct tet tet cgc tet 258 
Pro Pro Pro Ser Thr lie Pro Thr Ser Pro Pro Pro Ser Ser Arg Ser 
40 45 50 

aca cct tet get cct cct cca tet cca cca act cca tet aeg ceg gga 3 06 

Thr Pro Ser Ala Pro Pro Pro Ser Pro Pro Thr Pro Ser Thr Pro Gly 
55 60 65 70 

tet cca cct cct ett cct eag ceg tet cca cce get cca act aeg cce 354 
Ser Pro Pro Pro Leu Pro Gin Pro Ser Pro Pro Ala Pro Thr Thr Pro 

75 80 85 

gga tet cca cce gca cct gtt act cet cet act ega aac cct cca cct 402 
Gly Ser Pro Pro Ala Pro Val Thr Pro Pro Thr Arg Asn Pro Pro Pro 
90 95 100 

tea gte cca gga cca ceg tec aat cct tea cgc gaa gga gga tet cct 450 
Ser Val Pro Gly Pro Pro Ser Asn Pro Ser Arg Glu Gly Gly Ser Pro 
105 110 115 

cga cct cca tet tet cce teg ceg ceg tet cct tet tec gae ggt tta 498 
Arg Pro Pro Ser Ser Pro Ser Pro Pro Ser Pro Ser Ser Asp Gly Leu 
120 125 130 

tea aca; gga gtg gtg gtg gga ate gee ate gga gga gte get ctg ett 546 
Ser Thr Gly Val Val Val Gly lie Ala lie Gly Gly Val Ala Leu Leu 
135 140 145 150 

gtg ata gtg act ctg att tgt etc etc tgt aag aag aaa cga egg aga 594 
Val lie Val Thr Leu lie Cys Leu Leu Cys Lys Lys Lys Arg Arg Arg 

155 160 165 



gac gaa gaa gat get tac tat gtt cct ccg cca cct cct cct ggt ccc 642 
Asp Glu Glu Asp Ala Tyr Tyr Val Pro Pro Pro Pro Pro Pro Gly Pro 
170 175 180 

aaa gcc gga gga cct tac ggt gga cag cag caa caa tgg egg caa caa 690 
Lys Ala Gly Gly Pro Tyr Gly Gly Gin Gin Gin Gin Trp Arg Gin Gin 
185 190 195 

aac gca aca cca ccg tea gat cat gtc gtg aeg tea eta cca eea cea 738 
Asn Ala Thr Pro Pro Ser Asp His Val Val Thr Ser Leu Pro Pro Pro 
200 205 210 

cct aag get cea tet cea eea egg caa cct cct cca cct cca cca ccg 786 
Pro Lys Ala Pro Ser Pro Pro Arg Gin Pro Pro Pro Pro Pro Pro Pro 
215 220 225 230 

cct ttc atg age age age ggc ggc tec gae tac teg gac cgt cca gtt 834 
Pro Phe Met Ser Ser Ser Gly Gly Ser Asp Tyr Ser Asp Arg Pro Val 

235 240 245 

ctt cct cca ccg tet eea ggg ctt gtg tta ggc ttc tec aaa age act 882 
Leu Pro Pro Pro Ser Pro Gly Leu Val Leu Gly Phe Ser Lys Ser Thr 
250 255 260 

ttc aca tac gag gag eta get aga gee ace aat ggt ttc tec gag gcg 93 0 

Phe Thr Tyr Glu Glu Leu Ala Arg Ala Thr Asn Gly Phe Ser Glu Ala 
265 270 275 

aac ttg tta gga caa ggc ggg ttc ggt tac gtg cac aaa ggt gtg ttg 978 
Asn Leu Leu Gly Gin Gly Gly Phe Gly Tyr Val His Lys Gly Val Leu 
280 285 290 

cct agt ggg aaa gaa gtt get gtg aag cag ttg aaa gtt ggg agt ggt 1026 
Pro Ser Gly Lys Glu Val Ala Val Lys Gin Leu Lys Val Gly Ser Glv 
295 300 305 310 

cag gga gag agg gag ttt cag gca gag gtt gag ate ate age aga gtt 1074 
Gin Gly Glu Arg Glu Phe Gin Ala Glu Val Glu lie He Ser Arg Val 

315 320 325 

cac cac agg cat ctg gtg tet ctt gtt ggt tat tgc ate gcc ggt gee 1122 
His His Arg His Leu Val Ser Leu Val Gly Tyr Cys He Ala Gly Ala 
330 335 340 

aaa aga ttg ctt gtc tat gag ttt gtt cct aac aac aat etc gag ctt 1170 
Lys Arg Leu Leu Val Tyr Glu Phe Val Pro Asn Asn Asn Leu Glu Leu 
345 350 355 

cac etc cat ggc gag gga egg cct aca atg gaa tgg age ace aga ttg 1218 
His Leu His Gly Glu Gly Arg Pro Thr Met Glu Trp Ser Thr Arg Leu 
360 365 370 

aag att get ctt gga tet get aaa gga ctt tet tat ctt cat gaa gat 1266 
Lys He Ala Leu Gly Ser Ala Lys Gly Leu Ser Tyr Leu His Glu Asp 
375 380 385 3 90 



tgc aat cct aaa ate att cac cgt gat ate aag get tea aac ata ttg 



1314 



Cys Asn Pro Lys lie lie His Arg Asp lie Lys Ala Ser Asn He Leu 

395 400 405 

ata gat ttc aag ttt gaa get aag gtt get gat ttt ggt ctt get aag 13 62 

lie Asp Phe Lys Phe Glu Ala Lys Val Ala Asp Phe Gly Leu Ala Lys 

410 415 420 

att get tet gat aea aac acg cat gta tea aea egt gtg atg gga ace 1410 

He Ala Ser Asp Thr Asn Thr His Val Ser Thr Arg Val Met Gly Thr 
425 430 435 

ttt ggg tae ttg get ccg gaa tac get gea age gga aag etc aeg gag 1458 

Phe Gly Tyr Leu Ala Pro Glu Tyr Ala Ala Ser Gly Lys Leu Thr Glu 
440 445 . 450 

aag tet gac gtt ttc tea ttt ggc gtt gtg ctt ttg gag etc att act 1506 

Lys Ser Asp Val Phe Ser Phe Gly Val Val Leu Leu Glu Leu He Thr 
455 460 465 470 

gga egt ega ecc gtt gat gee aac aat gtc tat gta gat gac age tta 1554 

Gly Arg Arg Pro Val Asp Ala Asn Asn Val Tyr Val Asp Asp Ser Leu 

475 480 . 485 



gtt gac tgg gea cga cea ttg ctt aac ega gea tet gag eaa gga gac 
Val Asp Trp Ala Arg Pro Leu Leu Asn Arg Ala Ser Glu Gin Gly Asp 
490 495 500 



1602 



ttt gag ggt tta get gat gea aag atg aat aat ggg tat gac aga gag 1650 
Phe Glu Gly Leu Ala Asp Ala Lys Met Asn Asn Gly Tyr Asp Arg Glu 
505 510 515 

gag atg get ege atg gtt get tgt get geg get tgt gtt ege eat tea 1698 
Glu Met Ala Arg Met Val Ala Cys Ala Ala Ala Cys Val Arg His Ser 
520 525 530 

get ege ege aga ect ege atg age eag att gtg egt geg tta gaa gga 1746 
Ala Arg Arg Arg Pro Arg Met Ser Gin He Val Arg Ala Leu Glu Gly 
535 540 545 550 

aat gta tea ctg tea gat ctt aac gaa ggg atg aga cea ggt eaa age 17 94 

Asn Val Ser Leu Ser Asp Leu Asn Glu Gly Met Arg Pro Gly Gin Ser 

555 560 565 

aat gta tac age tea tac gga gga age ace gat tat gac teg age eag 1842 
Asn Val Tyr Ser Ser Tyr Gly Gly Ser Thr Asp Tyr Asp Ser Ser Gin 
570 575 580 

tac aat gaa gac atg aag aag ttt agg aaa atg gea ctt gga act eaa 18 90 

Tyr Asn Glu Asp Met Lys Lys Phe Arg Lys Met Ala Leu Gly Thr Gin 
585 590 595 

gag tac aac gee acg ggt gag tac agt aat ccg ace agt gac tat gga 193 8 

Glu Tyr Asn Ala Thr Gly Glu Tyr Ser Asn Pro Thr Ser Asp Tyr Gly 
600 605 610 

ctg tae ccg tet ggt tea age age gag ggc eaa ace aea ege gaa atg 1986 
Leu Tyr Pro Ser Gly Ser Ser Ser Glu Gly Gin Thr Thr Arg Glu Met 
615 620 625 630 



gag atg ggg aag att aag aga acc ggt cag ggt tat agt gga cat tct 2 034 

Glu Met Gly Lys He Lys Arg Thr Gly Gin Gly Tyr Ser Gly Pro Ser 

635 640 645 

ctt t aaaccagatg ggagagaaat tgaagggtgt tttttcatta tttttttaaa 2 088 

Leu 



actgtaaaga tatgagaaaa ttgccttact ctaattaaaa ccactacgat ataaggttat 2148 
aatacgtttt gaattggttt ttaaaaaaaa aaaaaaaaaa a 218 9 
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250 








255 




Phe 


Ser 


Lys 


Ser 


Thr 


Phe 


Thr 


Tyr 


Glu 


Glu 


Leu 


Ala 


Arg 


Ala 


Thr 








260 










265 










270 






Asn 


Gly 


Phe 


Ser 


Glu 


Ala 


Asn 


Leu 


Leu 


Gly 


Gin 


Gly 


Gly 


Phe 


Gly 


Tyr 


Val 


His 


275 










280 










285 




Lys 


Gly 


Val 


Leu 


Pro 


Ser 


Gly 


Lys 


Glu 


Val 


Ala 


Val 


Lys 


Gin 




290 










295 










300 









Leu Lys Val Gly 
305 

Glu lie lie Ser 

Tyr Cys lie Ala 
340 

Asn Asn Asn Leu 

355 

Glu Trp Ser Thr 
370 

Ser Tyr Leu His 
385 

Lys Ala Ser Asn 

Asp Phe Gly Leu 
420 

Thr Arg Val Met 

435 

Ser Gly Lys Leu 
450 

Leu Leu Glu Leu 
465 

Tyr Val Asp Asp 

Ala Ser Glu Gin 

500 

Asn Gly Tyr Asp 
515 

Ala Cys Val Arg 
530 

Val Arg Ala Leu 
545 

Met Arg Pro Gly 

Asp Tyr Asp Ser 
580 

Met Ala Leu Gly 
595 

Pro Thr Ser Asp 
610 

Gin Thr Thr Arg 

625 

Gly Tyr Ser Gly 



Ser Gly Gin Gly 
310 

Arg Val His His 
325 

Gly Ala Lys Arg 

Glu Leu His Leu 

360 

Arg Leu Lys lie 

375 ' 
Glu Asp Cys Asn 
390 

lie Leu lie Asp 

405 

Ala Lys lie Ala 

Gly Thr Phe Gly 

440 

Thr Glu Lys Ser 
455 

lie Thr Gly Arg 
470 

Ser Leu Val Asp 
485 

Gly Asp Phe Glu 

Arg Glu Glu Met 
520 

His Ser Ala Arg 
535 

Glu Gly Asn Val 
550 

Gin Ser Asn Val 
565 

Ser Gin Tyr Asn 

Thr Gin Glu Tyr 
600 

Tyr Gly Leu Tyr 
615 

Glu Met Glu Met 
630 

Pro Ser Leu 
645 



Glu Arg Glu Phe 
315 

Arg His Leu Val 

330 

Leu Leu Val Tyr 
345 

His Gly Glu Gly 

Ala Leu Gly Ser 
380 

Pro Lys lie lie 
395 

Phe Lys Phe Glu 
410 

Ser Asp Thr Asn 
425 

Tyr Leu Ala Pro 

Asp Val Phe Ser 
460 

Arg Pro Val Asp 
475 

Trp Ala Arg Pro 
490 

Gly Leu Ala Asp 

505 

Ala Arg Met Val 

Arg Arg Pro Arg 
540 

Ser Leu Ser Asp 
555 

Tyr Ser Ser Tyr 
570 

Glu Asp Met Lys 

585 

Asn Ala Thr Gly 

Pro Ser Gly Ser 
620 

Gly Lys lie Lys 
635 



Gin Ala Glu Val 
320 

Ser Leu Val Gly 
335 

Glu Phe Val Pro 
350 

Arg Pro Thr Met 

365 

Ala Lys Gly Leu 

His Arg Asp lie 
400 

Ala Lys Val Ala 
415 

Thr His Val Ser 

430 

Glu Tyr Ala Ala 
445 

Phe Gly Val Val 

Ala Asn Asn Val 
480 

Leu Leu Asn Arg 
495 

Ala Lys Met Asn 

510 

Ala Cys Ala Ala 
525 

Met Ser Gin lie 

Leu Asn Glu Gly 
560 

Gly Gly Ser Thr 

575 

Lys Phe Arg Lys 
590 

Glu Tyr Ser Asn 
605 

Ser Ser Glu Gly 

Arg Thr Gly Gin 
640 



<210> 5 

<211> 721 

<212> PRT 

<213> Brassica napus 



<400> 5 



Leu Thr Leu Trp Ser Pro Cys Leu Leu Ser Ser Pro Ala Ala Ser Phe 
15 10 15 



His Ser Leu His Leu Pro Phe Phe Arg Ser Lys Glu Ala Asp Met Ser 
20 25 30 

Ser Ala Pro Ser Pro Gly Thr Gly Ser Pro Pro Ser Pro Pro Ser Asn 
35 40 45 

Ser Thr Thr Thr Thr Pro Pro Pro Ala Ser Ala Pro Pro Pro Thr Thr 
50 55 60 

Pro Ser Ser Pro Pro Pro Pro Ser Thr lie Pro Thr Ser Pro Pro Pro 
65 70 75 80 

Ser Ser Arg Ser Thr Pro Ser Ala Pro Pro Pro Ser Pro Pro Thr Pro 

85 90 95 

Ser Thr Pro Gly Ser Pro Pro Pro Leu Pro Gin Pro Ser Pro Pro Ala 
100 105 110 

Pro Thr Thr Pro Gly Ser Pro Pro Ala Pro Val Thr Pro Pro Thr Arg 
115 120 125 

Asn Pro Pro Pro Ser Val Pro Gly Pro Pro Ser Asn Pro Ser Arg Glu 
130 135 140 

Gly Gly Ser Pro Arg Pro Pro Ser Ser Pro Ser Pro Pro Ser Pro Ser 
145 150 155 160 

Ser Asp Gly Leu Ser Thr Gly Val Val Val Gly lie Ala lie Gly Gly 

165 170 175 

Val Ala Leu Leu Val He Val Thr Leu He Cys Leu Leu Cys Lys Lys 
180 185 190 

Lys Arg Arg Arg Asp Glu Glu Asp Ala Tyr Tyr Val Pro Pro Pro Pro 
195 200 205 

Pro Pro Gly Pro Lys Ala Gly Gly Pro Tyr Gly Gly Gin Gin Gin Gin 
210 215 220 

Trp Arg Gin Gin Asn Ala Thr Pro Pro Ser Asp His Val Val Thr Ser 
225 230 235 240 



Leu Pro Pro Pro Pro Lys Ala Pro Ser Pro Pro Arg Gin Pro Pro Pro 

245 250 255 



"lilJliljeb^y :Sil.!^4" 



Pro Pro Pro Pro Pro Phe Met Ser Ser Ser Gly Gly Ser Asp Tyr Ser 
260 265 270 

Asp Arg Pro Val Leu Pro Pro Pro Ser Pro Gly Leu Val Leu Gly Phe 
275 280 285 

Ser Lys Ser Thr Phe Thr Tyr Glu Glu Leu Ala Arg Ala Thr Asn Gly 
290 295 300 

Phe Ser Glu Ala Asn Leu Leu Gly Gin Gly Gly Phe Gly Tyr Val His 
305 310 315 320 

Lys Gly Val Leu Pro Ser Gly Lys Glu Val Ala Val Lys Gin Leu Lys 

325 330 335 

Val Gly Ser Gly Gin Gly Glu Arg Glu Phe Gin Ala Glu Val Glu lie 
340 345 350 

lie Ser Arg Val His His Arg His Leu Val Ser Leu Val Gly Tyr Cys 
355 360 365 

lie Ala Gly Ala Lys Arg Leu Leu Val Tyr Glu Phe Val Pro Asn Asn 
370 375 380 

Asn Leu Glu Leu His Leu His Gly Glu Gly Arg Pro Thr Met Glu Trp 
385 390 395 400 

Ser Thr Arg Leu Lys lie Ala Leu Gly Ser Ala Lys Gly Leu Ser Tyr 

405 410 415 

Leu His Glu Asp Cys Asn Pro Lys lie He His Arg Asp He Lys Ala 
420 425 ^ 430 

Ser Asn He Leu He Asp Phe Lys Phe Glu Ala Lys Val Ala Asp Phe 
435 440 445 

Gly Leu Ala Lys He Ala Ser Asp Thr Asn Thr His Val Ser Thr Arg 
450 455 460 

Val Met Gly Thr Phe Gly Tyr Leu Ala Pro Glu Tyr Ala Ala Ser Gly 
465 470 475 480 



Lys Leu Thr Glu Lys Ser Asp Val Phe Ser Phe Gly Val Val Leu Leu 

485 490 495 



Glu Leu lie Thr Gly Arg Arg Pro Val Asp Ala Asn Asn Val Tyr Val 
500 505 510 

Asp Asp Ser Leu Val Asp Trp Ala Arg Pro Leu Leu Asn Arg Ala Ser 
515 520 525 

Glu Gin Gly Asp Phe Glu Gly Leu Ala Asp Ala Lys Met Asn Asn Gly 
530 535 540 

Tyr Asp Arg Glu Glu Met Ala Arg Met Val Ala Cys Ala Ala Ala Cys 
545 550 555 560 

Val Arg His Ser Ala Arg Arg Arg Pro Arg Met Ser Gin lie Val Arg 

565 570 575 

Ala Leu Glu Gly Asn Val Ser Leu Ser Asp Leu Asn Glu Gly Met Arg 
580 585 590 

Pro Gly Gin Ser Asn Val Tyr Ser Ser Tyr Gly Gly Ser Thr Asp Tyr 
595 600 605 

Asp Ser Ser Gin Tyr Asn Glu Asp Met Lys Lys Phe Arg Lys Met Ala 
610 615 620 

Leu Gly Thr Gin Glu Tyr Asn Ala Thr Gly Glu Tyr Ser Asn Pro Thr 
625 630 635 640 

Ser Asp Tyr Gly Leu Tyr Pro Ser Gly Ser Ser Ser Glu Gly Gin Thr 

645 650 655 

Thr Arg Glu Met Glu Met Gly Lys lie Lys Arg Thr Gly Gin Gly Tyr 
660 665 670 

Ser Gly Pro Ser Leu Thr Arg Trp Glu Arg Asn Arg Val Phe Phe His 
675 680 685 

Tyr Phe Phe Lys Thr Val Lys lie Glu Asn Cys Leu Thr Leu lie Lys 
690 695 700 



Thr Thr Thr lie Gly Tyr Asn Thr Phe lie Gly Phe Lys Lys Lys Lys 
705 710 715 720 



IP '^-3 3 Kin- uia-'iu « ai'- 



Lys 



<210> 6 
<211> 2231 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 6 

tagaaaaaaa aaaatgtcag acttaggcga gtcgccgagt tcttcaccac cagcaccacc 60 
agctgatacc gctcctccac cagagactcc atcagaaaac tcagctcttc cacctgttga 12 0 
ttcctctcct cctagtccac cagctgattc atcatcaaca ccgccgctgt cagaaccatc 180 
cactcctcct ccagattcac agcttcctcc tttaccttcg attcttcctc cgctaacaga 240 
ttctccacct ccaccttccg attcttctcc acccgttgat tcaacccctt ctccgccgcc 3 00 
accgacgtca aacgaatctc cttctcctcc agaagattcc gaaacaccac ctgctccacc 360 
aaatgaatcc aatgacaaca accctcctcc gtctcaagat cttcaatcgc ctcctccatc 420 
gtcgccgtcg ccgaatgtag gacccacaaa cccggaatca ccaccgttac aatctcctcc 480 
agctccacca gcatcagatc ctacaaattc accgccagct tcaccattag accctaccaa 540 
tcctccccca atacaaccat caggaccagc cacttctcct ccggctaatc ccaacgctcc 600 
gccgagccca ttccccacag taccacccaa aactccttct agtggacctg tggtgtctcc 660 
atctctcaca tcccctagta aaggaactcc tactccaaac caaggcaatg gagatggcgg 720 
tggcggtggt ggcggctatc aagggaagac tatggttggt atggctgtag ccggtttcgc 780 
aatcatggcg cttataggcg ttgtgttctt agtgagaaga aagaaaaaga gaaacattga 840 
tagctataat cactcacagt acttgccaca tcccaatttc tctgttaaat cagatggatt 900 
cttatacggt caagatccag gtaaaggata ctcctctggt cctaatggtt caatgtataa 960 
caattcacag caacaacaat cctctatggg aaacagttat ggtacagctg gtggtggtta 102 0 
tcctcatcat caaatgcaat caagtggcac acctgactct gctatactcg gaagtggcca 1080 
gactcatttc agttacgaag agcttgctga gataacacaa ggctttgctc gcaaaaacat 114 0 
tcttggagaa ggcggatttg gatgtgtcta taaaggtaca ttgcaggatg gtaaagttgt 1200 
tgcggttaag cagcttaaag ctggaagtgg acaaggtgac cgtgaattca aagcagaggt 1260 
tgagatcatc agccgcgttc atcatcgcca tttggtctct ctggttggtt actgcatttc 1320 
agaccagcat agattgctta tctatgagta tgtttctaat caaaccttgg agcatcattt 13 8 0 
gcatgagtgg tctaagagag tccggatcgc tataggatca gccaaagggt tggcatatct 144 0 
tcacgaagac tgtcatccga aaatcattca cagagatata aagtcagcaa atattcttct 150 0 
agatgatgaa tatgaagctc aggcaataat gaaatcctcc ttttcgttaa atctatctta 1560 
tgactgtaaa gttttagttg ctgattttgg acttgctaga ctcaatgata caacacaaac 162 0 
tcatgtttca actcgggtta tgggaacctt cgggtaccta gcgccggaat atgcatcaag 168 0 
tggaaaattg actgatagat ccgatgtatt ctcattcggg gttgttctct tagagcttgt 1740 
aactggacgg aaaccagttg accagactca gcctctagga gaagagagtt tggttgaatg 18 0 0 
ggcgcgcccg ctgcttctca aagccattga gaccggagat ttaagcgaac tgattgatac 1860 
acggcttgaa aagcgttatg tggagcatga agtcttcaga atgatcgaga cagccgctgc 1920 
atgtgttaga cattctggtc caaaacgccc acgcatggtt caggttgtga gagcattgga 1980 
ctgcgacgga gactcgggag atattagcaa cggaatcaaa attgggcaaa gcacaactta 2 04 0 
tgactcaggg caatacaatg aagacattat gaaattcagg aaaatggcgt ttggtggtga 2100 
taacagcgta gagtcaggat tgtacagtgg aaactactct gccaaaagct cttcagattt 2160 
ctcagggaat gaatctgaga ctcggccttt caacaaccga cggttctgat catacaatag 222 0 
gtgaaagtaa c 2231 



<210> 7 
<211> 2196 
<212> DNA 

<213> Arabidopsis thaliana 



<220> 
<221> CDS 
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<222> (1) . . (2196) 
<400> 7 

atg tea gac tta ggc gag teg ccg agt tct tea cca eca gca cca eea 48 
Met Ser Asp Leu Gly Glu Ser Pro Ser Ser Ser Pro Pro Ala Pro Pro 
15 10 15 

get gat acc get cct eea eea gag aet eca tea gaa aac tea get ctt 96 
Ala Asp Thr Ala Pro Pro Pro Glu Thr Pro Ser Glu Asn Ser Ala Leu 
20 25 30 

eea eet gtt gat tee tet eet eet agt cca cea get gat tea tea tea 144 
Pro Pro Val Asp Ser Ser Pro Pro Ser Pro Pro Ala Asp Ser Ser Ser 
35 40 45 

aca eeg ccg ctg tea gaa eca tec act cct eet eea gat tea eag ctt 192 
Thr Pro Pro Leu Ser Glu Pro Ser Thr Pro Pro Pro Asp Ser Gin Leu 
50 55 60 

cct cct tta cct teg att ctt cct ccg eta aca gat tct eea cct cea 240 
Pro Pro Leu Pro Ser lie Leu Pro Pro Leu Thr Asp Ser Pro Pro Pro 
65 70 75 80 

cct tec gat tct tct eea cec gtt gat tea acc eet tct ccg ccg eca 288 
Pro Ser Asp Ser Ser Pro Pro Val Asp Ser Thr Pro Ser Pro Pro Pro 

85 90 95 

ccg acg tea aac gaa tct cct tct cct cca gaa gat tec gaa aca cca 336 
Pro Thr Ser Asn Glu Ser Pro Ser Pro Pro Glu Asp Ser Glu Thr Pro 
100 105 110 

eet get cca cca aat gaa tec aat gac aac aac cct cct ccg tet caa 3 84 
Pro Ala Pro Pro Asn Glu Ser Asn Asp Asn Asn Pro Pro Pro Ser Gin 
115 120 125 

gat ctt caa teg cct cct cca teg teg ccg teg ccg aat gta gga cec 432 
Asp Leu Gin Ser Pro Pro Pro Ser Ser Pro Ser Pro Asn Val Gly Pro 
130 135 140 

aca aac eeg gaa tea cca ccg tta caa tct cct cea get cea cea gca 480 
Thr Asn Pro Glu Ser Pro Pro Leu Gin Ser Pro Pro Ala Pro Pro Ala 
145 150 155 160 

tea gat eet aca aat tea ccg eea get tea cca tta gac eet acc aat 528 
Ser Asp Pro Thr Asn Ser Pro Pro Ala Ser Pro Leu Asp Pro Thr Asn 

165 170 175 

cct cec cca ata caa cca tea gga eca gee act tct cct eeg get aat 576 
Pro Pro Pro lie Gin Pro Ser Gly Pro Ala Thr Ser Pro Pro Ala Asn 
180 185 190 

cec aac get ccg eeg age cca tte cec aca gta eea ece aaa act cct 624 
Pro Asn Ala Pro Pro Ser Pro Phe Pro Thr Val Pro Pro Lys Thr Pro 
195 200 205 

tct agt gga cct gtg gtg tct cca tct etc aca tec eet agt aaa gga 672 
Ser Ser Gly Pro Val Val Ser Pro Ser Leu Thr Ser Pro Ser Lys Gly 
210 215 220 



act cct act cca aac caa ggc aat gga gat ggc ggt ggc ggt ggt ggc 720 
Thr Pro Thr Pro Asn Gin Gly Asn Gly Asp Gly Gly Gly Gly Gly Gly 
225 230 235 240 

ggc tat caa ggg aag act atg gtt ggt atg get gta gcc ggt ttc gca 768 
Gly Tyr Gin Gly Lys Thr Met Val Gly Met Ala Val Ala Gly Phe Ala 

245 250 255 

ate atg gcg ctt ata ggc gtt gtg ttc tta gtg aga aga aag aaa aag 816 
lie Met Ala Leu lie Gly Val Val Phe Leu Val Arg Arg Lys Lys Lys 
260 265 270 

aga aac att gat age tat aat cac tea eag tac ttg cca eat cec aat 864 
Arg Asn lie Asp Ser Tyr Asn His Ser Gin Tyr Leu Pro His Pro Asn 
275 280 285 

ttc tet gtt aaa tea gat gga ttc tta tac ggt caa gat cca ggt aaa 912 
Phe Ser Val Lys Ser Asp Gly Phe Leu Tyr Gly Gin Asp Pro Gly Lys 
290 295 300 

gga tac tec tet ggt cct aat ggt tea atg tat aac aat tea eag caa 960 
Gly Tyr Ser Ser Gly Pro Asn Gly Ser Met Tyr Asn Asn Ser Gin Gin 
305 310 315 320 

caa caa tec tet atg gga aac agt tat ggt aca get ggt ggt ggt tat 100 8 
Gin Gin Ser Ser Met Gly Asn Ser Tyr Gly Thr Ala Gly Gly Gly Tyr 

325 330 335 

cct eat cat caa atg caa tea agt ggc aca eet gae tet get ata etc 1056 
Pro His His Gin Met Gin Ser Ser Gly Thr Pro Asp Ser Ala lie Leu 
340 345 350 

gga agt ggc cag act eat ttc agt tac gaa gag ctt get gag ata aca 1104 
Gly Ser Gly Gin Thr His Phe Ser Tyr Glu Glu Leu Ala Glu lie Thr 
355 360 365 

caa ggc ttt get cgc aaa aac att ctt gga gaa ggc gga ttt gga tgt 1152 
Gin Gly Phe Ala Arg Lys Asn He Leu Gly Glu Gly Gly Phe Gly Cys 
370 375 380 

gtc tat aaa ggt aca ttg cag gat ggt aaa gtt gtt gcg gtt aag eag 12 00 
Val Tyr Lys Gly Thr Leu Gin Asp Gly Lys Val Val Ala Val Lys Gin 
385 390 395 400 

ctt aaa get gga agt gga caa ggt gae cgt gaa ttc aaa gca gag gtt 124 8 
Leu Lys Ala Gly Ser Gly Gin Gly Asp Arg Glu Phe Lys Ala Glu Val 

405 410 415 

gag ate ate age cgc gtt cat cat cgc cat ttg gtc tet ctg gtt ggt 1296 
Glu lie lie Ser Arg Val His His Arg His Leu Val Ser Leu Val Gly 
420 425 430 

tac tgc att tea gae cag eat aga ttg ctt ate tat gag tat gtt tet 1344 
Tyr Cys He Ser Asp Gin His Arg Leu Leu He Tyr Glu Tyr Val Ser 
435 440 445 

aat caa ace ttg gag cat cat ttg cat gag tgg tet aag aga gtc egg 13 92 
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Asn 


Gin 
450 


Thr 


Leu 


Glu 


His 


His 
455 


Leu 


His 


Glu 


Trp 


Ser 
460 


Lys 


Arg 


Val 


Arg 


ate 
He 
465 


get 
Ala 


ata 
He 


gga 
Gly 


tea 
Ser 


gee 
Ala 
470 


aaa 
Lys 


ggg 

Gly 


ttg 
Leu 


gea 
Ala 


tat 
Tyr 
475 


ett 
Leu 


cac 
His 


gaa 
Glu 


gae 
Asp 


tgt 

Cys 
480 


cat 
His 


ccg 
Pro 


aaa 
Lys 


ate 
He 


att 
He 
485 


cac 
His 


aga 
Arg 


gat 
Asp 


ata 
He 


aag 
Lys 
490 


tea 
Ser 


gea 
Ala 


aat 
Asn 


att 
He 


ett 
Leu 
495 


eta 
Leu 


gat gat 
Asp Asp 


gaa 
Glu 


tat 
Tyr 
500 


gaa 
Glu 


get 
Ala 


cag 
Gin 


gea 
Ala 


ata 
He 
505 


atg 
Met 


aaa 
Lys 


tec 
Ser 


tec 
Ser 


ttt 
Phe 
510 


teg 
Ser 


tta 
Leu 


aat 

7\ GTI 


eta 

J— ici LL 


tct 
Ser 
515 


tat 
Tyr 


gae 
Asp 


tgt 
Cys 


aaa 
Lys 


gtt 
Val 
520 


tta 
Leu 


gtt 
Val 


get 
Ala 


gat 
Asp 


ttt 
Phe 
525 


gga 
Gly 


ett 
Leu 


get 
Ala 


aga 


etc 

T .£^1 1 
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530 


aat 
Asn 


gat 

Asp 


aea 
Thr 


aea 
Thr 


caa 
Gin 
535 


act 
Thr 


eat 
His 


gtt 
Val 


tea 
Ser 


act 
Thr 
540 


egg 
Arg 


gtt 
Val 


atg 
Met 


gga 
Gly 


acc 

J. 

545 


ttc 

XrXXC 


ggg 

Gly 


tac 
Tyr 


eta 
Leu 


gcg 
Ala 
550 


ccg 
Pro 


gaa 
Glu 


tat 
Tyr 


gea 
Ala 


tea 
Ser 
555 


agt 
Ser 


gga 
Gly 


aaa 
Lys 


ttg 
Leu 


act 
Thr 
560 


gat 


aga 


tee 
Ser 


gat 
Asp 


gta 
Val 
565 


ttc 
Phe 


tea 
Ser 


ttc 
Phe 


ggg 

Gly 


gtt 
Val 
570 


gtt 
Val 


etc 
Leu 


tta 
Leu 


gag 
Glu 


ett 
Leu 
575 


gta 
Val 


act 


gga 


egg 
Arg 


aaa 
Lys 
580 


cca 
Pro 


gtt 
Val 


gae 
Asp 


cag 
Gin 


act 
Thr 
585 


cag 
Gin 


cct 
Pro 


eta 
Leu 


gga 
Gly 


gaa 
Glu 
590 


gag 
Glu 


agt 
Ser 


ttg 
Leu 


gtt 
Val 


gaa 
Glu 
595 


tgg 
Trp 


gcg 
Ala 


cgc 
Arg 


ccg 
Pro 


ctg 
Leu 
600 


ett 
Leu 


etc 
Leu 


aaa 
Lys 


gee 
Ala 


att 
He 
605 


gag 
Glu 


ace 
Thr 


gga 
Gly 


gat 
Asp 


tta 

Leu 
610 


age 
Ser 


gaa 
Glu 


ctg 
Leu 


att 
He 


gat 
Asp 
615 


aea 
Thr 


egg 
Arg 


ett 
Leu 


gaa 
Glu 


aag 
Lys 
620 


cgt 
Arg 


tat 
Tyr 


gtg 

Val 


gag 
Glu 


cat 
His 
625 


gaa 
Glu 


gtc 
Val 


ttc 
Phe 


aga 
Arg 


atg 
Met 
630 


ate 
He 


gag 
Glu 


aca 
Thr 


gcc 
Ala 


get 
Ala 
635 


gea 
Ala 


tgt 
Cys 


gtt 
Val 


aga 
Arg 


cat 
His 
640 


tct 
Ser 


ggt 

Gly 


cca 
Pro 


aaa 
Lys 


cgc 
Arg 
645 


cca 
Pro 


cgc 
Arg 


atg 
Met 


gtt 
Val 


cag 
Gin 
650 


gtt 
Val 


gtg 

Val 


aga 
Arg 


gea 
Ala 


ttg 
Leu 
655 


gae 
Asp 


tgc 
Cys 


gae 
Asp 


gga gae 
Gly Asp 
660 


teg 
Ser 


gga gat 
Gly Asp 


att 
He 


age 
Ser 
665 


aac 
Asn 


gga 
Gly 


ate 
He 


aaa 
Lys 


att 
He 
670 


ggg 

Gly 


caa 
Gin 


age 
Ser 


aea 
Thr 


act 
Thr 


tat 
Tyr 


gae 
Asp 


tea 
Ser 


ggg 

Gly 


caa 
Gin 


tac 
Tyr 


aat 
Asn 


gaa 
Glu 


gae 
Asp 


att 
He 


atg 
Met 


aaa 
Lys 


ttc 
Phe 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



1776 



1824 



1872 



1920 



1968 



2016 



2064 



675 680 685 



agg aaa atg gcg ttt ggt ggt gat aac 

Arg L.ys Met Ala Phe Gly Gly Asp Asn 
690 695 

a-gt gga aac tac tct gcc aaa age tct 

Ser Gly Asn Tyr Ser Ala Lys Ser Ser 
705 710 

tct gag act egg cct ttc aac aac cga 
Ser Glu Thr Arg Pro Phe Asn Asn Arg 

725 



age gta gag tea gga ttg tac 2112 
Ser Val Glu Ser Gly Leu Tyr 
700 

tea gat ttc tea ggg aat gaa 2160 
Ser Asp Phe Ser Gly Asn Glu 
715 720 

egg ttc tga 2196 

Arg Phe 

730 



<210> 8 
<211> 731 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 8 

Met Ser Asp Leu Gly Glu Ser Pro 

1 5 
Ala Asp Thr Ala Pro Pro Pro Glu 
20 



Ser Ser Ser Pro Pro Ala Pro Pro 

10 15 
Thr Pro Ser Glu Asn Ser Ala Leu 

25 30 



Pro Pro Val Asp 

35 

Thr Pro Pro Leu 
50 

Pro Pro Leu Pro 

65 

Pro Ser Asp Ser 

Pro Thr Ser Asn 
100 

Pro Ala Pro Pro 
115 

Asp Leu Gin Ser 
130 

Thr Asn Pro Glu 
145 

Ser Asp Pro Thr 

Pro Pro Pro lie 
180 

Pro Asn Ala Pro 
195 

Ser Ser Gly Pro 

210 

Thr Pro Thr Pro 
225 

Gly Tyr Gin Gly 

lie Met Ala Leu 
260 

Arg Asn lie Asp 
275 

Phe Ser Val Lys 



Ser Ser Pro Pro 
40 

Ser Glu Pro Ser 
55 

Ser lie Leu Pro 

70 

Ser Pro Pro Val 
85 

Glu Ser Pro Ser 

Asn Glu Ser Asn 
120 

Pro Pro Pro Ser 
135 

Ser Pro Pro Leu 

150 

Asn Ser Pro Pro 
165 

Gin Pro Ser Gly 

Pro Ser Pro Phe 
200 

Val Val Ser Pro 

215 

Asn Gin Gly Asn 

230 

Lys Thr Met Val 
245 

lie Gly Val Val 

Ser Tyr Asn His 
280 

Ser Asp Gly Phe 



Ser Pro Pro Ala 

Thr Pro Pro Pro 
60 

Pro Leu Thr Asp 
75 

Asp Ser Thr Pro 

90 

Pro Pro Glu Asp 
105 

Asp Asn Asn Pro 

Ser Pro Ser Pro 
140 

Gin Ser Pro Pro 
155 

Ala Ser Pro Leu 
170 

Pro Ala Thr Ser 
185 

Pro Thr Val Pro 

Ser Leu Thr Ser 
220 

Gly Asp Gly Gly 
235 

Gly Met Ala Val 
250 

Phe Leu Val Arg 
265 

Ser Gin Tyr Leu 
Leu Tyr Gly Gin 



Asp Ser Ser Ser 
45 

Asp Ser Gin Leu 

Ser Pro Pro Pro 
80 

Ser Pro Pro Pro 

95 

Ser Glu Thr Pro 
110 

Pro Pro Ser Gin 
125 

Asn Val Gly Pro 

Ala Pro Pro Ala 

160 

Asp Pro Thr Asn 
175 

Pro Pro Ala Asn 
190 

Pro Lys Thr Pro 
205 

Pro Ser Lys Gly 

Gly Gly Gly Gly 
240 

Ala Gly Phe Ala 
255 

Arg Lys Lys Lys 

270 

Pro His Pro Asn 
285 

Asp Pro Gly Lys 



290 

Gly Tyr Ser Ser 
305 

Gin Gin Ser Ser 

Pro His His Gin 
340 

Gly Ser Gly Gin 
355 

Gin Gly Phe Ala 

370 

Val Tyr Lys Gly 

385 

Leu Lys Ala Gly 

Gin lie lie Ser 
420 

Tyr Cys lie Ser 
435 

Asn Gin Thr Leu 
450 

He Ala He Gly 

465 

His Pro Lys He 

Asp Asp Glu Tyr 
500 

Asn Leu Ser Tyr 
515 

Arg Leu Asn Asp 

530 

Thr Phe Gly Tyr 
545 

Asp Arg Ser Asp 

Thr Gly Arg Lys 
580 

Leu Val Glu Trp 
595 

Asp Leu Ser Glu 
610 

His Glu Val Phe 
625 

Ser Gly Pro Lys 

Cys Asp Gly Asp 
660 

Ser Thr Thr Tyr 

675 

Arg Lys Met Ala 
690 

Ser Gly Asn Tyr 
705 

Ser Glu Thr Arg 



295 

Gly Pro Asn Gly 
310 

Met Gly Asn Ser 

325 

Met Gin Ser Ser 

Thr His Phe Ser 
360 

Arg Lys Asn He 
375 

Thr Leu Gin Asp 

390 

Ser Gly Gin Gly 
405 

Arg Val His His 

Asp Gin His Arg 
440 

Glu His His Leu 
455 

Ser Ala Lys Gly 

470 

He His Arg Asp 
485 

Glu Ala Gin Ala 

Asp Cys Lys Val 
520 

Thr Thr Gin Thr 

535 

Leu Ala Pro Glu 
550 

Val Phe Ser Phe 
565 

Pro Val Asp Gin 

Ala Arg Pro Leu 
600 

Leu He Asp Thr 

615 

Arg Met He Glu 
630 

Arg Pro Arg Met 
645 

Ser Gly Asp He 

Asp Ser Gly Gin 
680 

Phe Gly Gly Asp 
695 

Ser Ala Lys Ser 
710 

Pro Phe Asn Asn 
725 



300 

Ser Met Tyr Asn 
315 

Tyr Gly Thr Ala 

330 

Gly Thr Pro Asp 
345 

Tyr Glu Glu Leu 

Leu Gly Glu Gly 
380 

Gly Lys Val Val 
395 

Asp Arg Glu Phe 
410 

Arg His Leu Val 
425 

Leu Leu He Tyr 

His Glu Trp Ser 
460 

Leu Ala Tyr Leu 

475 

He Lys Ser Ala 
490 

He Met Lys Ser 
505 

Leu Val Ala Asp 

His Val Ser Thr 
540 

Tyr Ala Ser Ser 

555 

Gly Val Val Leu 
570 

Thr Gin Pro Leu 

585 

Leu Leu Lys Ala 

Arg Leu Glu Lys 

620 

Thr Ala Ala Ala 
635 

Val Gin Val Val 
650 

Ser Asn Gly He 
665 

Tyr Asn Glu Asp 

Asn Ser Val Glu 

700 

Ser Ser Asp Phe 
715 

Arg Arg Phe 
730 



Asn Ser Gin Gin 
320 

Gly Gly Gly Tyr 

335 

Ser Ala He Leu 
350 

Ala Glu He Thr 
365 

Gly Phe Gly Cys 

Ala Val Lys Gin 
400 

Lys Ala Glu Val 

415 

Ser Leu Val Gly 
430 

Glu Tyr Val Ser 

445 

Lys Arg Val Arg 

His Glu Asp Cys 

480 

Asn He Leu Leu 
495 

Ser Phe Ser Leu 
510 

Phe Gly Leu Ala 
525 

Arg Val Met Gly 

Gly Lys Leu Thr 

560 

Leu Glu Leu Val 
575 

Gly Glu Glu Ser 
590 

He Glu Thr Gly 
605 

Arg Tyr Val Glu 

Cys Val Arg His 
640 

Arg Ala Leu Asp 
655 

Lys He Gly Gin 
670 

He Met Lys Phe 
685 

Ser Gly Leu Tyr 

Ser Gly Asn Glu 
720 



<210> 9 
<211> 1939 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 9 

gaaaattttg atctccgatg gcttcttctc 
cttctccatc tccaccgtct aataccaatt 
ctccttctcc tacacctcct caaggagact 
ctccaccagc tccacaagct cctaaccctc 
cgtcacaggg cggtggagga gaaagaggaa 
caccgtcacg cggctctcct ccttctcctc 
gcagatcatc gccaccagga gacactggag 
gaggaagcag tggaggaggt ggaggtggaa 
gtgtattagt cggagctgga cttttgatga 
aaaagaagag aaaagactcc ttctaccctg 
atggaaacaa caacaacaac aatgcttcac 
aaggccaaaa ccaacaatct actggtggtt 
ctccgcggat gcctacaagc ggagaagatt 
ttttacctcc tccttcgcct gctctagccc 
aagagcttgc ggctgcaaca ggagggttta 
ttgggtatgt ccataaagga gtcttgccta 
aagcgggtag cggacaagga gagagggagt 
tgcatcatcg gtatcttgtt tctttggttg 
tggtttatga gtttgttcct aacaaaactt 
cggtaatgga gttctccact aggttgcgta 
accttcacga agactgccat cctcggatca 
tcttggactt caactttgat gctatggtgg 
ataacaacac tcatgtatct actcgtgtga 
atgcttcaag cggtaaatta accgagaaat 
tggaacttat aactggaaaa cgaccggttg 
tagattgggc tcggcctctt atggctcgcg 
cagatgcgag gcttgaaggc aactacaacc 
ccgctgctag cattcgtcat tcggggcgta 
cgttagaagg agaagtgtcc ttagatgctt 
acgtttacgg gtcattggga gcaagctcgg 
tgaagaaatt cagacagata gctttgtcga 
gaacatctag taatgattct agagatatgg 
atcgaatcaa tgattctgt 



ctgaatctgc tcctccaaca aactccacct 60 
caaccacctc ttctccgccg gctccgtctc 120 
catcatcatc gccacctcct gattccacat 180 
ctaattcctc taataactct ccttcccctc 240 
atggaggaaa caatggtggc aatgatactc 3 00 
cttctaggag taatggagat aatggtggta 3 60 
gctctcgctc agacaaccct ccttctagcg 420 
gaagtaatac gaatacagcg atcatagttg 48 0 
tcgttcttat tattgtgtgt cttagacgca 540 
aacccatgaa aggaaaccaa tatcaatact 60 0 
agaattatcc gaattggcac ctaaattcac 660 
9SS9^99^99 tggaccatca ccgcctcctc 72 0 
cttccatgta ctcaggccca tcacgcccag 780 
tcggattcaa caagagcact tttacttacc 840 
cggatgctaa ccttttggga cagggaggat 900 
gcgggaaaga agtagcagtt aagagtttaa 960 
ttcaagctga ggtcgatatc attagccgtg 1020 
gatattgcat agctgatgga cagaggatgt 1080 
tggaatatca tcttcatggg aaaaatcttc 1140 
tcgccttagg tgctgcgaaa ggactcgctt 12 0 0 
ttcaccgcga catcaagtct gcaaatattc 1260 
ctgattttgg attagctaag ttaacatctg 1320 
tgggaacttt cggatatcta gctccagaat 13 8 0 
ccgatgtttt ctcttacgga gttatgttat 1440 
ataatagcat caccatggac gacaccttag 15 0 0 
cgctagaaga tggaaacttt aatgagctcg 1560 
cgcaagaaat ggctcgaatg gtgacttgtg 162 0 
aacgtccaaa gatgagccag atagtaagag 168 0 
taaacgaagg tgtgaagcca ggacacagta 174 0 
attatagtca gacatcttac aatgcagaca 1800 
gccaagaatt cccagtcagt gactgtgaag 1860 
gaactaagag ccctactcct ccaaaatgag 192 0 

1939 



<210> 10 
<211> 1902 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 

<221> CDS 

<222> (1) . . (1902) 

<400> 10 

atg get tct tct cct gaa tct get cet cca aea aae tec ace tct tct 48 

Met Ala Ser Ser Pro Glu Ser Ala Pro Pro Thr Asn Ser Thr Ser Ser 
15 10 15 



cca tct cea ccg tct aat acc aat tea ace acc tct tct ccg ccg get 96 
Pro Ser Pro Pro Ser Asn Thr Asn Ser Thr Thr Ser Ser Pro Pro Ala 
20 25 30 



J.'fi J JIJ J qi" ... JIJ aC4li£:3i ri 



ccg tct act cct tct cct aca cct cct caa gga gac tea tea tea teg 
Pro Ser Pro Pro Ser Pro Thr Pro Pro Gin Gly Asp Ser Ser Ser Ser 
35 40 45 



144 



cca cct cct gat tec aca tct eca cca get eca caa get cct aac cct 192 
Pro Pro Pro Asp Ser Thr Ser Pro Pro Ala Pro Gin Ala Pro Asn Pro 
50 55 60 

cct aat tec tct aat aac tct cct tec cct ccg tea cag ggc ggt gga 240 
Pro Asn Ser Ser Asn Asn Ser Pro Ser Pro Pro Ser Gin Gly Gly Gly 
65 70 75 80 



gga gaa aga gga aat gga gga aac aat ggt ggc aat gat act cca ccg 
Gly Glu Arg Gly Asn Gly Gly Asn Asn Gly Gly Asn Asp Thr Pro Pro 

85 90 95 

tea cgc ggc tct cct cct tct cct cct tct agg agt aat gga gat aat 
Ser Arg Gly Ser Pro Pro Ser Pro Pro Ser Arg Ser Asn Gly Asp Asn 
100 105 110 

ggt ggt age aga tea teg cca cca gga gac act gga ggc tct cgc tea 
Gly Gly Ser Arg Ser Ser Pro Pro Gly Asp Thr Gly Gly Ser Arg Ser 
115 120 125 



gga ctt ttg atg ate gtt ctt att att gtg tgt ctt aga cgc aaa aag 
Gly Leu Leu Met lie Val Leu lie lie Val Cys Leu Arg Arg Lys Lys 

165 170 175 



tgg gga ggc ggt gga cca tea ccg cct cct cct ccg egg atg cct aca 
Trp Gly Gly Gly Gly Pro Ser Pro Pro Pro Pro Pro Arg Met Pro Thr 
225 230 235 240 



288 



336 



384 



gac aac cct cct tct age gga gga age agt gga gga ggt gga ggt gga 432 
Asp Asn Pro Pro Ser Ser Gly Gly Ser Ser Gly Gly Gly Gly Gly Gly 
130 135 140 

aga agt aat aeg aat aca gcg ate ata gtt ggt gta tta gtc gga get 4 80 

Arg Ser Asn Thr Asn Thr Ala lie lie Val Gly Val Leu Val Gly Ala 
145 150 155 160 



528 



aag aga aaa gac tec ttc tac cct gaa cec atg aaa gga aac caa tat 576 
Lys Arg Lys Asp Ser Phe Tyr Pro Glu Pro Met Lys Gly Asn Gin Tyr 
180 185 190 

caa tac tat gga aac aac aac aac aac aat get tea cag aat tat ccg 624 
Gin Tyr Tyr Gly Asn Asn Asn Asn Asn Asn Ala Ser Gin Asn Tyr Pro 
195 200 205 

aat tgg cac eta aat tea caa ggc caa aac caa caa tct act ggt ggt 672 
Asn Trp His Leu Asn Ser Gin Gly Gin Asn Gin Gin Ser Thr Gly Gly 
210 215 220 



720 



age gga gaa gat tct tec atg tac tea ggc cca tea cgc cca gtt tta 768 
Ser Gly Glu Asp Ser Ser Met Tyr Ser Gly Pro Ser Arg Pro Val Leu 

245 250 255 
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cct cct cct teg cct get eta gee etc gga ttc aac aag age aet ttt 
Pro Pro Pro Ser Pro Ala Leu Ala Leu Gly Phe Asn Lys Ser Thr Phe 
260 265 270 



816 



aet tae eaa gag ctt gcg get gea aea gga ggg ttt acg gat get aae 864 
Thr Tyr Gin Glu Leu Ala Ala Ala Thr Gly Gly Phe Thr Asp Ala Asn 
275 280 285 

ett ttg gga eag gga gga ttt ggg tat gtc cat aaa gga gte ttg cct 912 
Leu Leu Gly Gin Gly Gly Phe Gly Tyr Val His Lys Gly Val Leu Pro 
290 295 300 

age ggg aaa gaa gta gea gtt aag agt tta aaa gcg ggt age gga eaa 960 
Ser Gly Lys Glu Val Ala Val Lys Ser Leu Lys Ala Gly Ser Gly Gin 
305 310 315 320 



gga gag agg gag ttt eaa get gag gtc gat ate att age cgt gtg cat 
Gly Glu Arg Glu Phe Gin Ala Glu Val Asp lie lie Ser Arg Val His 

325 330 335 



ate gee tta ggt get gcg aaa gga etc get tae ett cac gaa gac tgc 
lie Ala Leu Gly Ala Ala Lys Gly Leu Ala Tyr Leu His Glu Asp Cys 
385 , 390 395 400 

cat cct egg ate att cac cgc gac ate aag tet gea aat att etc ttg 
His Pro Arg lie He His Arg Asp He Lys Ser Ala Asn He Leu Leu 

405 410 415 



aaa cga ccg gtt gat aat age ate acc atg gac gac ace tta gta gat 
Lys Arg Pro Val Asp Asn Ser lie Thr Met Asp Asp Thr Leu Val Asp 



1008 



cat egg tat ctt gtt tet ttg gtt gga tat tgc ata get gat gga eag 1056 
His Arg Tyr Leu Val Ser Leu Val Gly Tyr Cys lie Ala Asp Gly Gin 
340 345 350 

agg atg ttg gtt tat gag ttt gtt cct aac aaa act ttg gaa tat eat 1104 
Arg Met Leu Val Tyr Glu Phe Val Pro Asn Lys Thr Leu Glu Tyr His 
355 360 365 

ctt cat ggg aaa aat ett ccg gta atg gag ttc tec act agg ttg cgt 1152 
Leu His Gly Lys Asn Leu Pro Val Met Glu Phe Ser Thr Arg Leu Arg 
370 375 380 



1200 



1248 



gac ttc aac ttt gat get atg gtg get gat ttt gga tta get aag tta 1296 
Asp Phe Asn Phe Asp Ala Met Val Ala Asp Phe Gly Leu Ala Lys Leu 
420 425 430 

aea tet gat aac aae act cat gta tet act cgt gtg atg gga act ttc 1344 
Thr Ser Asp Asn Asn Thr His Val Ser Thr Arg Val Met Gly Thr Phe 
435 440 445 

gga tat eta get eca gaa tat get tea age ggt aaa tta acc gag aaa 13 92 
Gly Tyr Leu Ala Pro Glu Tyr Ala Ser Ser Gly Lys Leu Thr Glu Lys 
450 455 460 

tee gat gtt ttc tet tae gga gtt atg tta ttg gaa ctt ata act gga 1440 
Ser Asp Val Phe Ser Tyr Gly Val Met Leu Leu Glu Leu He Thr Gly 

465 470 475 480 



1488 
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485 490 495 

tgg get egg cct ctt atg get cgc gcg eta gaa gat gga aae ttt aat 153 6 
Trp Ala Arg Pro Leu Met Ala Arg Ala Leu Glu Asp Gly Asn Phe Asn 
500 505 510 

gag etc gea gat geg agg ctt gaa gge aae tae aae eeg eaa gaa atg 1584 
Glu Leu Ala Asp Ala Arg Leu Glu Gly Asn Tyr Asn Pro Gin Glu Met 
515 520 525 

get ega atg gtg act tgt gee get get age att egt eat teg ggg egt 1632 
Ala Arg Met Val Thr Cys Ala Ala Ala Ser lie Arg His Ser Gly Arg 
530 535 540 

aaa egt eea aag atg age cag ata gta aga geg tta gaa gga gaa gtg 1680 
Lys Arg Pro Lys Met Ser Gin lie Val Arg Ala Leu Glu Gly Glu Val 
545 550 555 560 

tee tta gat get tta aae gaa ggt gtg aag eea gga cae agt aae gtt 1728 
Ser Leu Asp Ala Leu Asn Glu Gly Val Lys Pro Gly His Ser Asn Val 

565 570 575 

tae ggg tea ttg gga gea age teg gat tat agt eag aca tet tae aat 1776 
Tyr Gly Ser Leu Gly Ala Ser Ser Asp Tyr Ser Gin Thr Ser Tyr Asn 
580 585 590 



gea gae atg aag aaa tte aga eag ata get ttg teg age caa gaa tte 
Ala Asp Met Lys Lys Phe Arg Gin lie Ala Leu Ser Ser Gin Glu Phe 
595 600 605 



gga act aag age cct act cct cca aaa tga 
Gly Thr Lys Ser Pro Thr Pro Pro Lys 
625 630 



<210> 11 
<211> 633 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 11 

Met Ala Ser Ser Pro Glu Ser Ala Pro Pro Thr Asn Ser Thr Ser Ser 

15 10 15 

Pro Ser Pro Pro Ser Asn Thr Asn Ser Thr Thr Ser Ser Pro Pro Ala 

20 25 30 

Pro Ser Pro Pro Ser Pro Thr Pro Pro Gin Gly Asp Ser Ser Ser Ser 

35 40 45 

Pro Pro Pro Asp Ser Thr Ser Pro Pro Ala Pro Gin Ala Pro Asn Pro 

50 55 60 

Pro Asn Ser Ser Asn Asn Ser Pro Ser Pro Pro Ser Gin Gly Gly Gly 
65 70 75 80 

Gly Glu Arg Gly Asn Gly Gly Asn Asn Gly Gly Asn Asp Thr Pro Pro 

85 90 95 

Ser Arg Gly Ser Pro Pro Ser Pro Pro Ser Arg Ser Asn Gly Asp Asn 



1824 



eea gtc agt gae tgt gaa gga aea tet agt aat gat tct aga gat atg 18 72 
Pro Val Ser Asp Cys Glu Gly Thr Ser Ser Asn Asp Ser Arg Asp Met 
610 615 620 



1902 



100 

Gly Gly Ser Arg 
115 

Asp Asn Pro Pro 

130 

Arg Ser Asn Thr 
145 

Gly Leu Leu Met 

Lys Arg Lys Asp 
180 

Gin Tyr Tyr Gly 

195 

Asn Trp His Leu 
210 

Trp Gly Gly Gly 
225 

Ser Gly Glu Asp 

Pro Pro Pro Ser 
260 

Thr Tyr Gin Glu 

275 

Leu Leu Gly Gin 
290 

Ser Gly Lys Glu 
305 

Gly Glu Arg Glu 

His Arg Tyr Leu 

340 

Arg Met Leu Val 
355 

Leu His Gly Lys 
370 

lie Ala Leu Gly 

385 

His Pro Arg lie 

Asp Phe Asn Phe 

420 

Thr Ser Asp Asn 
435 

Gly Tyr Leu Ala 
450 

Ser Asp Val Phe 
465 

Lys Arg Pro Val 

Trp Ala Arg Pro 
500 

Glu Leu Ala Asp 
515 

Ala Arg Met Val 
530 

Lys Arg Pro Lys 
545 

Ser Leu Asp Ala 



Ser Ser Pro Pro 
120 

Ser Ser Gly Gly 

135 

Asn Thr Ala lie 
150 

lie Val Leu lie 
165 

Ser Phe Tyr Pro 

Asn Asn Asn Asn 
200 

Asn Ser Gin Gly 
215 

Gly Pro Ser Pro 
230 

Ser Ser Met Tyr 
245 

Pro Ala Leu Ala 

Leu Ala Ala Ala 

280 

Gly Gly Phe Gly 
295 

Val Ala Val Lys 
310 

Phe Gin Ala Glu 

325 

Val Ser Leu Val 

Tyr Glu Phe Val 
360 

Asn Leu Pro Val 
375 

Ala Ala Lys Gly 
390 

lie His Arg Asp 
405 

Asp Ala Met Val 

Asn Thr His Val 
440 

Pro Glu Tyr Ala 
455 

Ser Tyr Gly Val 
470 

Asp Asn Ser He 
485 

Leu Met Ala Arg 

Ala Arg Leu Glu 
520 

Thr Cys Ala Ala 
535 

Met Ser Gin lie 
550 

Leu Asn Glu Gly 



105 

Gly Asp Thr Gly 

Ser Ser Gly Gly 
140 

lie Val Gly Val 
155 

He Val Cys Leu 
170 

Glu Pro Met Lys 
185 

Asn Asn Ala Ser 

Gin Asn Gin Gin 

220 

Pro Pro Pro Pro 
235 

Ser Gly Pro Ser 
250 

Leu Gly Phe Asn 
265 

Thr Gly Gly Phe 

Tyr Val His Lys 
300 

Ser Leu Lys Ala 
315 

Val Asp He He 
330 

Gly Tyr Cys He 
345 

Pro Asn Lys Thr 

Met Glu Phe Ser 
380 

Leu Ala Tyr Leu 
395 

He Lys Ser Ala 
410 

Ala Asp Phe Gly 

425 

Ser Thr Arg Val 

Ser Ser Gly Lys 
460 

Met Leu Leu Glu 
475 

Thr Met Asp Asp 
490 

Ala Leu Glu Asp 

505 

Gly Asn Tyr Asn 

Ala Ser He Arg 
540 

Val Arg Ala Leu 
555 

Val Lys Pro Gly 



110 

Gly Ser Arg Ser 
125 

Gly Gly Gly Gly 

Leu Val Gly Ala 
160 

Arg Arg Lys Lys 
175 

Gly Asn Gin Tyr 
190 

Gin Asn Tyr Pro 
205 

Ser Thr Gly Gly 

Arg Met . Pro Thr 
240 

Arg Pro Val Leu 

255 

Lys Ser Thr Phe 
270 

Thr Asp Ala Asn 

285 

Gly Val Leu Pro 

Gly Ser Gly Gin 
320 

Ser Arg Val His 
335 

Ala Asp Gly Gin 
350 

Leu Glu Tyr His 

365 

Thr Arg Leu Arg 

His Glu Asp Cys 
400 

Asn He Leu Leu 
415 

Leu Ala Lys Leu 

430 

Met Gly Thr Phe 
445 

Leu Thr Glu Lys 

Leu He Thr Gly 
480 

Thr Leu Val Asp 
495 

Gly Asn Phe Asn 
510 

Pro Gin Glu Met 
525 

His Ser Gly Arg 

Glu Gly Glu Val 
560 

His Ser Asn Val 



565 570 575 

Tyr Gly Ser Leu Gly Ala Ser Ser Asp Tyr Ser Gin Thr Ser Tyr Asn 

580 585 590 

Ala Asp Met Lys Lys Phe Arg Gin lie Ala Leu Ser Ser Gin Glu Phe 

595 600 605 

Pro Val Ser Asp Cys Glu Gly Thr Ser Ser Asn Asp Ser Arg Asp Met 

610 615 620 

Gly Thr Lys Ser Pro Thr Pro Pro Lys 
625 630 



<210> 12 
<211> 2104 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 12 

tccaccgttt gagaaaccct aataacaaca 
tctcctgccc ctgaaacctc aaatgggaca 
gagtcatcgc cgccaacacc accttcttca 
ccagatatct ccgcttcttt ttcaccgccg 
cctacatctc cgtcctcatc gccgcctgtt 
aatccttctc cacctgcacc tgaaggctca 
acaccgtcga accaatcacc ggaaagacca 
cgaaacagaa ccaatggcgg aaacaacaac 
tcagggaaca gaacttccgg tgacggtggc 
cctcagaata gtggagattc agactcatca 
atttgcatct gttgcaacag gaagaagaag 
tactacaata acaatcctta tggaggagca 
acacctcaag atcatgtggt gaatatggct 
caacctgtgt ctggtcctca cagtgatgct 
tcacctcaag ctgcaactct tggtcacaac 
attgcaacag aaggtttcgc tcagtcaaat 
cataaaggag ttctgcctag tggcaaagaa 
ggacaagggg aacgcgagtt tcaagcagag 
catctcgttt ctcttgttgg atattgcatc 
tttataccta acaacactct tgaatttcat 
tggcctacaa gagtgaagat tgcattggga 
gactgtaaga aaatctttat ctcacatatt 
gatatcaaag ctgcaaacat tcttcttgat 
ggattggcta agctatctca agacaactat 
tttggatact tagctccaga gtatgcatca 
ttctcatttg gagtaatgct tcttgagctc 
ggagaaatgg aagatagctt ggtagattgg 
gatggagatt acaaccaatt ggctgatcca 
atggttcaaa tggcttcttg tgcagctgca 
aagatgagcc aggttcaaaa actcatacca 
gaaggagata tgtcaatgga tgatctaagt 
ttgagccccg ggagcgtgag ctcagagtat 
aagttcaaga aactggcgtt agagaataaa 
agtgagtatg gcttaaaccc ttctgcttca 
aaacgcaatc ctcagctttg aaagaagaga 
ctct 



ttcaaaatgg cggactcacc ggtggattca 6 0 
ccaccgtcaa acggaacatc gccgtctaat 12 0 
ccaccaccat catcaatatc tgctcctccg 180 
cctgcaccac caacgcaaga aacgtcacct 240 
gtagctaatc cgtcaccgca gactccagag 3 00 
actcctgtaa cgccacctgc accaccacaa 360 
actcctcctt ctcctggtgc caatgatgac 42 0 
agagacggct ccacaccatc accaccgtcg 480 
tcaccttcac cacctcggtc gataagccct 540 
tcggggcttt tgcttctact tgcagtgtgt 600 
aagaaatctc ctcaggtcaa ccacatgcac 660 
ccctcaggta atggtggtta ttacaaggga 72 0 
ggtcaaggag gtgggaattg gggtccacag 7 80 
tccaacttaa ccggtcgaac tgctataccg 840 
caaagcactt tcacatacga tgaactgtcc 900 
ttgctaggac aaggaggatt tgggtatgtt 96 0 
gttgcagtga agagtcttaa acttggaagt 1020 
gttgatatca ttagccgtgt ccatcatcgt 1080 
tctggtggtc aaagactttt ggtttatgag 1140 
cttcatggaa agggtcgtcc ggttttggat 12 0 0 
tcagctagag gccttgcata tttgcatgaa 1260 
tgcatcagtc accctcgcat tatccacaga 1320 
ttcagttttg agaccaaggt ggcagatttt 13 8 0 
actcatgtct ccactcgcgt catgggaact 1440 
agcggaaagt tatccgacaa atctgatgtt 15 00 
ataaccggaa gacctcctct ggatctaact 1560 
gcaaggcctt tgtgtttgaa agcagctcaa 1620 
cgtctagagc taaactacag tcatcaagag 1680 
gcaatcagac attcagcaag aagacggcct 174 0 
cttgttggtt ctattattgt acgagcacta 18 0 0 
gagggaacaa gaccaggaca aagcacgtac 1860 
gacgcaagct cgtacacggc agacatgaaa 192 0 
gaatatcaaa gcagtgaata tggtggaaca 198 0 
agtagtgaag aaatgaatag aggctcaatg 2 04 0 
caacacttgt cataatattt cagttttctt 2100 

2104 



<210> 13 
<211> 2025 



<212> DNA 

<213> Arabidopsis thaliana 

<220> 

<221> CDS 

<222> (1) . . (2025) 

<400> 13 

a.tg gcg gac tea ccg gtg gat tea tct cct gcc cct gaa acc tea aat 48 

Met Ala Asp Ser Pro Val Asp Ser Ser Pro Ala Pro Glu Thr Ser Asn 
15 10 15 

ggg aea eea ecg tea aae gga aea teg eeg tet aat gag tea teg eeg 96 
Gly Thr Pro Pro Ser Asn Gly Thr Ser Pro Ser Asn Glu Ser Ser Pro 
20 25 30 

eea aea eea eet tet tea eea cea eea tea tea ata tet get eet eeg 144 
Pro Thr Pro Pro Ser Ser Pro Pro Pro Ser Ser lie Ser Ala Pro Pro 

35 40 45 

eea gat ate tee get tct ttt tea eeg eeg eet gca eea cea aeg eaa 192 
Pro Asp lie Ser Ala Ser Phe Ser Pro Pro Pro Ala Pro Pro Thr Gin 
50 55 60 

gaa acg tea cct cct aca tct ccg tec tea teg ccg cct gtt gta get 240 
Glu Thr Ser Pro Pro Thr Ser Pro Ser Ser Ser Pro Pro Val Val Ala 
65 70 75 80 

aat eeg tea eeg eag act eea gag aat eet tet cea cct gea cct gaa 288 
Asn Pro Ser Pro Gin Thr Pro Glu Asn Pro Ser Pro Pro Ala Pro Glu 

85 90 , 95 

ggc tea act cct gta acg cea cct gca cea cea caa aca ccg teg aae 336 
Gly Ser Thr Pro Val Thr Pro Pro Ala Pro Pro Gin Thr Pro Ser Asn 
100 105 110 

caa tea ccg gaa aga cea act cct cct tet cct ggt gcc aat gat gac 3 84 
Gin Ser Pro Glu Arg Pro Thr Pro Pro Ser Pro Gly Ala Asn Asp Asp 
115 120 125 

ega aae aga ace aat ggc gga aac aac aae aga gac ggc tee aea eea 432 
Arg Asn Arg Thr Asn Gly Gly Asn Asn Asn Arg Asp Gly Ser Thr Pro 
130 135 140 

tea cea eeg teg tea ggg aae aga act tee ggt gac ggt gge tea eet 480 
Ser Pro Pro Ser Ser Gly Asn Arg Thr Ser Gly Asp Gly Gly Ser Pro 
145 150 155 160 

tea cea eet egg teg ata age eet eet eag aat agt gga gat tea gae 528 
Ser Pro Pro Arg Ser lie Ser Pro Pro Gin Asn Ser Gly Asp Ser Asp 

165 170 175 

tea tea teg ggg ett ttg ett eta ett gea gtg tgt att tge ate tgt 576 
Ser Ser Ser Gly Leu Leu Leu Leu Leu Ala Val Cys lie Cys lie Cys 

180 185 190 



tge aae agg aag aag aag aag aaa tet eet eag gtc aae eae atg eac 
Cys Asn Arg Lys Lys Lys Lys Lys Ser Pro Gin Val Asn His Met His 



624 
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195 

tac tac aat 
Tyr Tyr Asn 
210 

tat tac aag 
Tyr Tyr Lys 
225 

gga ggt ggg 
Gly Gly Gly 



gat get tec 
Asp Ala Ser 



gca act ctt 
Ala Thr Leu 
275 

att gca aca 
lie Ala Thr 
290 

ttt ggg tat 
Phe Gly Tyr 
305 

gtg aag agt 
Val Lys Ser 



gca gag gtt 
Ala Glu Val 



ctt gtt gga 
Leu Val Gly 
355 

ttt ata cct 
Phe lie Pro 
370 

ccg gtt ttg 
Pro Val Leu 
385 

aga ggc .ctt 
Arg Gly Leu 



cat att tgc 
His lie Cys 



aac aat cct 
Asn Asn Pro 



gga aca cct 
Gly Thr Pro 
230 

aat tgg ggt 
Asn Trp Gly 
245 

aac tta acc 
Asn Leu Thr 
260 

ggt cac aac 
Gly His Asn 



gaa ggt ttc 
Glu Gly Phe 



gtt cat aaa 
Val His Lys 
310 

ctt aaa ctt 
Leu Lys Leu 
325 

gat ate att 
Asp lie lie 
340 

tat tgc ate 
Tyr Gys lie 



aac aac act 
Asn Asn Thr 



gat tgg cct 
Asp Trp Pro 
390 

gca tat ttg 
Ala Tyr Leu 
405 

ate agt cac 
lie Ser His 
420 



200 

tat gga gga 
Tyr Gly Gly 
215 

caa gat eat 
Gin Asp His 



cea cag caa 
Pro Gin Gin 



ggt cga act 
Gly Arg Thr 
265 

caa age act 
Gin Ser Thr 
280 

get cag tea 
Ala Gin Ser 
295 

gga gtt ctg 
Gly Val Leu 



gga agt gga 
Gly Ser Gly 



age egt gtc 
Ser Arg Val 

345 

tct ggt ggt 
Ser Gly Gly 
360 

ctt gaa ttt 
Leu Glu Phe 
375 

aca aga gtg 
Thr Arg Val 



cat gaa gac 
His Glu Asp 



cct cge att 
Pro Arg lie 
425 



gca ccc tea 
Ala Pro Ser 
220 

gtg gtg aat 
Val Val Asn 
235 

cct gtg tct 
Pro Val Ser 
250 

get ata ccg 
Ala lie Pro 



ttc aca tac 
Phe Thr Tyr 



aat ttg eta 
Asn Leu Leu 
300 

cct agt ggc 
Pro Ser Gly 
315 

caa ggg gaa 
Gin Gly Glu 
330 

cat cat egt 
His His Arg 



caa aga ctt 
Gin Arg Leu 



cat ctt cat 
His Leu His 
380 

aag att gca 
Lys He Ala 
395 

tgt aag aaa 
Cys Lys Lys 
410 

ate cac aga 
He His Arg 



205 

ggt aat ggt 
Gly Asn Gly 



a.tg get ggt 
Met Ala Gly 



ggt cct cac 
Gly Pro His 
255 

tea cct caa 
Ser Pro Gin 
270 

gat gaa ctg 
Asp Glu Leu 
285 

gga caa gga 
Gly Gin Gly 



aaa gaa gtt 
Lys Glu Val 



cge gag ttt 
Arg Glu Phe 
335 

cat etc gtt 
His Leu Val 

350 

ttg gtt tat 
Leu Val Tyr 
365 

gga aag ggt 
Gly Lys Gly 



ttg gga tea 
Leu Gly Ser 



ate ttt ate 
He Phe He 
415 

gat ate aaa 
Asp He Lys 
430 



ggt 672 
Gly 



caa 720 

Gin 

240 

agt 768 
Ser 



get 816 
Ala 



tee 864 
Ser 



gga 912 
Gly 



gca 960 

Ala 

320 

caa 1008 
Gin 



tct 1056 
Ser 



gag 1104 
Glu 



egt 1152 
Arg 



get 1200 

Ala 

400 

tea 1248 
Ser 



get 1296 
Ala 
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gca aac att ctt ctt gat ttc agt ttt gag acc aag gtg gca gat ttt 1344 
Ala Asn lie Leu Leu Asp Phe Ser Phe Glu Thr Lys Val Ala Asp Phe 
435 440 445 

gga ttg get aag eta tct caa gac aac tat act cat gtc tec act egc 13 92 
Gly Leu Ala Lys Leu Ser Gin Asp Asn Tyr Thr His Val Ser Thr Arg 
450 455 460 

gtc atg gga act ttt gga tac tta get cca gag tat gca tea age gga 1440 
Val Met Gly Thr Phe Gly Tyr Leu Ala Pro Glu Tyr Ala Ser Ser Gly 
465 470 475 480 

aag tta tec gac aaa tct gat gtt ttc tea ttt gga gta atg ctt ctt 1488 
Lys Leu Ser Asp Lys Ser Asp Val Phe Ser Phe Gly Val Met Leu Leu 

485 490 495 

gag etc ata acc gga aga cct cct ctg gat eta act gga gaa atg gaa 153 6 
Glu Leu lie Thr Gly Arg Pro Pro Leu Asp Leu Thr Gly Glu Met Glu 
500 505 510 

gat age ttg gta gat tgg gca agg cct ttg tgt ttg aaa gca get caa 1584 
Asp Ser Leu Val Asp Trp Ala Arg Pro Leu Cys Leu Lys Ala Ala Gin 
515 520 525 

gat gga gat tac aac caa ttg get gat cca cgt eta gag eta aac tac 163 2 
Asp Gly Asp Tyr Asn Gin Leu Ala Asp Pro Arg Leu Glu Leu Asn Tyr 
530 535 540 



agt cat caa gag atg gtt caa atg get tct tgt gca get gca gca ate 
Ser His Gin Glu Met Val Gin Met Ala Ser Cys Ala Ala Ala Ala lie 
545 550 555 560 



1680 



aga cat tea gca aga aga egg cct aag atg age cag gtt caa aaa etc 1728 
Arg His Ser Ala Arg Arg Arg Pro Lys Met Ser Gin Val Gin Lys Leu 

565 570 575 

ata cca ctt gtt ggt tct att att gta ega gca eta gaa gga gat atg 1776 
lie Pro Leu Val Gly Ser lie lie Val Arg Ala Leu Glu Gly Asp Met 
580 585 590 

tea atg gat gat eta agt gag gga aca aga cca gga caa age aeg tac 1824 
Ser Met Asp Asp Leu Ser Glu Gly Thr Arg Pro Gly Gin Ser Thr Tyr 
595 600 605 

ttg age cec ggg age gtg age tea gag tat gac gca age teg tac acg 1872 
Leu Ser Pro Gly Ser Val Ser Ser Glu Tyr Asp Ala Ser Ser Tyr Thr 
610 615 620 

gca gac atg aaa aag ttc aag aaa ctg gcg tta gag aat aaa gaa tat 192 0 
Ala Asp Met Lys Lys Phe Lys Lys Leu Ala Leu Glu Asn Lys Glu Tyr 
625 630 635 640 

caa age agt gaa tat ggt gga aca agt gag tat ggc tta aac cct tct 1968 
Gin Ser Ser Glu Tyr Gly Gly Thr Ser Glu Tyr Gly Leu Asn Pro Ser 

645 650 655 

get tea agt agt gaa gaa atg aat aga ggc tea atg aaa egc aat cct 2 016 
Ala Ser Ser Ser Glu Glu Met Asn Arg Gly Ser Met Lys Arg Asn Pro 
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660 665 670 

cag ctt tga 2025 
Glh Leu 



<210> 14 
<211> 674 
<212> PRT 

<213> Arabidopsis thaliana 
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<210> 15 
<211> 2261 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 15 

catacatcac aaaacggcat cgttttgatg tcgctctctc cttcttcctc gccggctccg 60 
gcaacttccc cgcctgccat gtcattacct ccggcagatt ccgtacctga cacgtcatca 120 
cctccagctc ctcctttgtc tcctcttccc ccaccattga gctctcctcc gccgttgcct 180 
tcaccaccgc ctctctccgc tcccaccgct tccccaccgc ctcttccggt tgaatcccca 240 
ccgtctcctc ctatagaatc accaccgcct cctctactgg aatcacctcc tcctcctccg 300 
ttggaatctc catctccacc gtctcctcac gtctcagctc cttccggttc accgccatta 360 
cccttccttc ccgccaaacc ttctccgccg ccttcttcac ctccctccga gacagttccg 420 
ccgggaaata cgatttctcc accacctcgt tcacttccct ccgaatcaac cccgccggtg 480 
aacacagctt ctcctccacc gccatctcct cctcgccgcc gtagtggccc taagccttcg 540 



tttcctcctc ccatcaattc ttctccacca 
gaaacttctc ctccacctaa accaccgctc 
cccccgccta agaagtcccc tgcagcagta 
ttaccggatg ggaccgtagc acctcctatt 
gccgaatcaa tatctccggg aacgccacag 
acgtcgtatc accgatcatc cgccggattc 
ctactaattc tgttaggtct tctctttgtc 
aacagcagct ctgctcatca tcaatccaaa 
ggtaatgctg gtacgaacca ggcacatgtt 
tatatatcta gtggaggttg tgatacgaag 
atgccatctg gaatgttctc ctacgaagaa 
gagaaccttt tgggagaagg cggtttcgga 
acagaagttg cggtgaagca gctgaagatt 
gctgaggttg acacaatcag tagggttcat 
tgcgttaatg gagataaaag actcttggtt 
ttccacttgc atgagaacag aggaagcgtg 
gtaggagcag caaaaggatt agcttatctt 
cgtgatatca aagcagctaa tatccttcta 
tttggactag ccaagttttt ctcagacacc 
gtggtaggaa ctttcggata catggctcca 
aaatcagatg tatattcctt tggggtcgtg 
attttcgcca aagattcttc cacaaaccag 
acgaaagcaa tctctggaga aagttttgac 
tacgatacaa ctcagatggc aaacatggct 
gcttggcttc ggcctagaat gagccaggta 
agaaaggtcg aagagactgg gaatagcgtg 
atcacaccac ggtatggaac aaataagagg 
acttcagaat atggagttaa cccttctcag 
tagttcacag gttcaatagg gcaagtttca 



aatccttctc cgaacactcc gtcactccca 600 
tcaacgacgc catttccctc ctcatccact 660 
actcttcctt tctttgggcc agcgggccaa 720 
gggcctgtta tagagcccaa gacgagtcca 780 
ccactggttc cgaagagtct acctgtaacg 840 
ttatttggcg gtgtaatcgt tggagctctt 900 
ttctacagag ctaccagaaa tagaaataac 96 0 
actccctcaa aagttcaaca tcatcggggc 1020 
atcacaatgc caccaccaat ccatgctaaa 1080 
gagaacaatt ctgttgcgaa aaacatttca 1140 
ctttcaaaag caactggtgg attttcagag 1200 
tatgttcaca aaggagtgtt gaaaaacggg 12 6 0 
gggagctatc aaggggaaag agaattccaa 13 2 0 
cataagcacc tcgtttcatt ggttggttat 1380 
tacgagtttg ttcctaaaga taccttggag 1440 
ttggaatggg aaatgaggct caggattgct 1500 
catgaggatt gcagtccaac tataattcac 156 0 
gattccaaat ttgaggcaaa ggtctctgac 1620 
aattcatcat tcactcatat ctctactcga 1680 
gaatacgcgt ccagtggtaa agtaactgat 174 0 
cttctagaac tcatcactgg acgtccatca 1800 
agtttagtag actgggcgag gccattgctt 1860 
tttcttgtag actcaaggtt ggagaagaat 1920 
gcttgtgctg ctgcttgcat acgccaatca 1980 
gtacgtgctc ttgaaggcga ggtggccctg 2 04 0 
acctatagct cttctgaaaa cccgaatgac 2100 
agattcgaca caggttcaag cgatggttac 2160 
tcgagcagtg aacatcaaca ggtgaatact 222 0 
ccacaattat t 2261 



<210> 16 
<211> 2196 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 

<221> CDS 

<222> (1) . . (2196) 

<400> 16 

atg teg etc tct cct tct tec teg ccg get ceg gca act tec ccg ect 48 

Met Ser Leu Ser Pro Ser Ser Ser Pro Ala Pro Ala Thr Ser Pro Pro 

15 10 15 

gee atg tea tta ect ccg gca gat tec gta cct gac aeg tea tea cct 96 
Ala Met Ser Leu Pro Pro Ala Asp Ser Val Pro Asp Thr Ser Ser Pro 
20 25 30 

cea get cct cct ttg tct cct ett cec cca eca ttg age tct cct ccg 144 
Pro Ala Pro Pro Leu Ser Pro Leu Pro Pro Pro Leu Ser Ser Pro Pro 
35 40 45 

ccg ttg cct tea cca ccg cct etc tec get cec ace get tec cca ccg 192 
Pro Leu Pro Ser Pro Pro Pro Leu Ser Ala Pro Thr Ala Ser Pro Pro 
50 55 60 



cct ctt ccg gtt gaa tec cca ceg tct cct cct ata gaa tea cca ccg 240 
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Pro Leu Pro Val Glu Ser Pro Pro Ser Pro Pro lie Glu Ser Pro Pro 
65 70 75 80 

cct cct eta ctg gaa tea eet eet cct cct ccg ttg gaa tct cca tct 288 
Pro Pro Leu Leu Glu Ser Pro Pro Pro Pro Pro Leu Glu Ser Pro Ser 

85 90 95 

cca ccg tct cct cac gtc tea get cct tec ggt tea ccg cca tta ecc 336 
Pro Pro Ser Pro His Val Ser Ala Pro Ser Gly Ser Pro Pro Leu Pro 
100 105 110 

ttc ctt ecc gee aaa cct tct ccg ccg cct tct tea cct ecc tec gag 3 84 
Phe Leu Pro Ala Lys Pro Ser Pro Pro Pro Ser Ser Pro Pro Ser Glu 

115 120 125 

aca gtt ccg ccg gga aat acg att tct cca cca cct cgt tea ctt ecc 432 
Thr Val Pro Pro Gly Asn Thr lie Ser Pro Pro Pro Arg Ser Leu Pro 
130 135 140 

tee gaa tea acc ccg ccg gtg aac aca get tct cct cca ccg cca tct 480 
Ser Glu Ser Thr Pro Pro Val Asn Thr Ala Ser Pro Pro Pro Pro Ser 
145 150 155 160 

cct cct cgc cgc cgt agt ggc cct aag cct teg ttt cct cct ecc ate 528 
Pro Pro Arg Arg Arg Ser Gly Pro Lys Pro Ser Phe Pro Pro Pro lie 

165 170 175 

aat tct tct cca cca aat cct tct ccg aac act ccg tea etc cca gaa 576 
Asn Ser Ser Pro Pro Asn Pro Ser Pro Asn Thr Pro Ser Leu Pro Glu 
180 185 190 

act tct cct cca cct aaa cca ccg etc tea acg acg cca ttt ecc tee 624 
Thr Ser Pro Pro Pro Lys Pro Pro Leu Ser Thr Thr Pro Phe Pro Ser 
195 200 205 

tea tee act ecc ccg eet aag aag tee eet gea gea gta act ctt cct 672 
Ser Ser Thr Pro Pro Pro Lys Lys Ser Pro Ala Ala Val Thr Leu Pro 
210 215 220 

ttc ttt ggg cca geg ggc caa tta eeg gat ggg ace gta gea cct cct 720 
Phe Phe Gly Pro Ala Gly Gin Leu Pro Asp Gly Thr Val Ala Pro Pro 
225 230 235 240 

att ggg cct gtt ata gag ecc aag acg agt cca gcc gaa tea ata tct 768 
lie Gly Pro Val lie Glu Pro Lys Thr Ser Pro Ala Glu Ser lie Ser 

245 250 255 

99^ cca cag cca ctg gtt ccg aag agt eta cct gta acg acg 816 

Pro Gly Thr Pro Gin Pro Leu Val Pro Lys Ser Leu Pro Val Thr Thr 

260 265 270 

teg tat cac ega tea tec gee gga ttc tta ttt ggc ggt gta ate gtt 864 
Ser Tyr His Arg Ser Ser Ala Gly Phe Leu Phe Gly Gly Val lie Val 
275 280 285 

gga get ctt eta eta att ctg tta ggt ctt etc ttt gtc ttc tac aga 912 
Gly Ala Leu Leu Leu lie Leu Leu Gly Leu Leu Phe Val Phe Tyr Arg 
290 295 300 



get acc aga aat aga aat aac aac age age tct get eat cat caa tec 960 
Ala Thr Arg Asn Arg Asn Asn Asn Ser Ser Ser Ala His His Gin Ser 
305 310 315 320 

aaa act eee tea aaa gtt caa cat cat egg ggc ggt aat get ggt aeg 1008 
Lys Thr Pro Ser Lys Val Gin His His Arg Gly Gly Asn Ala Gly Thr 

325 330 335 

aac cag gca eat gtt ate aca atg eca eca cea ate eat get aaa tat 1056 
Asn Gin Ala His Val lie Thr Met Pro Pro Pro lie His Ala Lys Tyr 
340 345 350 

ata tct agt gga ggt tgt gat aeg aag gag aac aat tct gtt gcg aaa 1104 
lie Ser Ser Gly Gly Cys Asp Thr Lys Glu Asn Asn Ser Val Ala Lys 
355 360 365 

aac att tea atg eca tct gga atg ttc tec tac gaa gaa ett tea aaa 1152 
Asn lie Ser Met Pro Ser Gly Met Phe Ser Tyr Glu Glu Leu Ser Lys 
370 375 380 

gca act ggt gga ttt tea gag gag aac ctt ttg gga gaa ggc ggt ttc 1200 
Ala Thr Gly Gly Phe Ser Glu Glu Asn Leu Leu Gly Glu Gly Gly Phe 
385 390 395 400 

gga tat gtt cac aaa gga gtg ttg aaa aac ggg aca gaa gtt gcg gtg 1248 
Gly Tyr Val His Lys Gly Val Leu Lys Asn Gly Thr Glu Val Ala Val 

405 410 415 

aag cag ctg aag att ggg age tat caa ggg gaa aga gaa ttc caa get 1296 
Lys Gin Leu Lys lie Gly Ser Tyr Gin Gly Glu Arg Glu Phe Gin Ala 

420 425 430 

gag gtt gac aca ate agt agg gtt cat cat aag cac etc gtt tea ttg 1344 
Glu Val Asp Thr lie Ser Arg Val His His Lys His Leu Val Ser Leu 
435 440 445 

gtt ggt tat tge gtt aat gga gat aaa aga etc ttg gtt tac gag ttt 13 92 
Val Gly Tyr Cys Val Asn Gly Asp Lys Arg Leu Leu Val Tyr Glu Phe 
450 455 460 

gtt ect aaa gat ace ttg gag ttc cac ttg cat gag aac aga gga age 144 0 
Val Pro Lys Asp Thr Leu Glu Phe His Leu His Glu Asn Arg Gly Ser 
465 470 475 480 

gtg ttg gaa tgg gaa atg agg etc agg att get gta gga gca gca aaa 1488 
Val Leu Glu Trp Glu Met Arg Leu Arg lie Ala Val Gly Ala Ala Lys 

485 490 495 

gga tta get tat ctt cat gag gat tgc agt eca act ata att cac cgt 153 6 
Gly Leu Ala Tyr Leu His Glu Asp Cys Ser Pro Thr lie lie His Arg 
500 505 510 

gat ate aaa gca get aat ate ctt eta gat tec aaa ttt gag gca aag 1584 
Asp lie Lys Ala Ala Asn lie Leu Leu Asp Ser Lys Phe Glu Ala Lys 
515 520 525 

gtc tct gac ttt gga eta gee aag ttt ttc tea gac acc aat tea tea 1632 
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Val Ser Asp Ph.e Gly Leu Ala Lys Phe Phe Ser Asp Thr Asn Ser Ser 
530 535 540 

ttc act cat ate tct act cga gtg gta gga act ttc gga tac atg get 1680 
Phe Thr His He Ser Thr Arg Val Val Gly Thr Phe Gly Tyr Met Ala 
545 550 555 560 

cca gaa tac gcg tec agt ggt aaa gta act gat aaa tea gat gta tat 1728 
Pro Glu Tyr Ala Ser Ser Gly Lys Val Thr Asp Lys Ser Asp Val Tyr 

565 570 575 

tec ttt ggg gtc gtg ett eta gaa etc ate act gga cgt cca tea att 1776 
Ser Phe Gly Val Val Leu Leu Glu Leu He Thr Gly Arg Pro Ser He 

580 585 590 

ttc gcc aaa gat tct tee aca aac cag agt tta gta gac tgg gcg agg 1824 
Phe Ala Lys Asp Ser Ser Thr Asn Gin Ser Leu Val Asp Trp Ala Arg 
595 600 605 

cca ttg ett acg aaa gca ate tct gga gaa agt ttt gac ttt ctt gta 1872 
Pro Leu Leu Thr Lys Ala He Ser Gly Glu Ser Phe Asp Phe Leu Val 
610 615 620 

gac tea agg ttg gag aag aat tac gat aca act cag atg gca aac atg 192 0 
Asp Ser Arg Leu Glu Lys Asn Tyr Asp Thr Thr Gin Met Ala Asn Met 
625 630 635 640 

get get tgt get get get tge ata egc eaa tea get tgg ctt egg cet 1968 
Ala Ala Cys Ala Ala Ala Cys He Arg Gin Ser Ala Trp Leu Arg Pro 

645 650 655 

aga atg age cag gta gta cgt get ett gaa ggc gag gtg gcc etg aga 2 016 
Arg Met Ser Gin Val Val Arg Ala Leu Glu Gly Glu Val Ala Leu Arg 
660 665 670 

aag gtc gaa gag act ggg aat age gtg ace tat age tct tct gaa aac 2064 
Lys Val Glu Glu Thr Gly Asn Ser Val Thr Tyr Ser Ser Ser Glu Asn 
675 680 685 

ccg aat gac ate aca cca egg tat gga aca aat aag agg aga ttc gac 2112 
Pro Asn Asp He Thr Pro Arg Tyr Gly Thr Asn Lys Arg Arg Phe Asp 
690 695 700 

aca ggt tea age gat ggt tac act tea gaa tat gga gtt aac cet tct 2160 
Thr Gly Ser Ser Asp Gly Tyr Thr Ser Glu Tyr Gly Val Asn Pro Ser 
705 710 715 720 

cag teg age agt gaa cat eaa cag gtg aat act tag 2196 
Gin Ser Ser Ser Glu His Gin Gin Val Asn Thr 

725 730 
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<212> PRT 
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<210> 18 
<211> 27 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 
<220> 

<221> misc_f eature 
<222> (1) . . (27) 

<223> N(l) =i N(2) =i N(3) =T/c/A N(4) =i N(5) -A/T N(6) =T/C 
N(7) =A/g 

<400> 18 

ggnggtttcg gnatngtntt nnaangg 2 7 



<210> 19 
<211> 9 
<212> PRT 



<213> Brassica napus 
<220> 

<221> PEPTIDE 
<222> (1) . . (9) 
<223> X(l) =F/Y 

<400> 19 

Gly Gly Phe Gly He Val Xaa Lys Gly 
1 5 

<210> 20 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<220> 

<221> misc_f eature 
<222> (1) . . (22) 

<223> N(l) =i N(2) =i N{3) =T/g N(4) =i N(5) =i N(6) =A/g N(7) 
=A/g N(8) =T/c 

<400> 20 

aanatncnng ccatnccnaa nn 22 



<210> 21 

<211> 8 
<212> PRT 

<213> Brassica napus 
<400> 21 

Asp Phe Gly Met Ala Arg He Phe 
1 5 



<210> 22 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 22 

taaccaacaa gagaca 16 



<210> 23 

<211> 23 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 



<220> 

<2 21> misc_f eature 
<222> (1) . . (23) 

<223> N(l) =g/A N(2) =i N(3) =g/A N(4) =T/C N(5) =l N(6) =l N(7) 
=A/g N(8) =1 N(9) =A/g N(10) =A/g 

<400> 23 

annannttng cnannccnaa ntc 23 



<210> 24 

<211> 8 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 24 

Asp Ph.e Gly Leu Ala Lys Leu Leu 
1 5 



<210> 25 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 
<400> 25 

ccactcccaa ctttcaac 1^ 



<210> 26 
<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 26 

ggaaagcttg catgcctgca ggtcgac 27 



<210> 27 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 27 

cgcctgcagg taatacgact cactataggg 
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10 


zo 


30 


TTAACTCTCT 


GGTCTCCGTG 


TCTCCTCTCT 


70 


80 


90 


CATTTGCCTT 


T1*TGATTTAG 


AT CCT^AAGAA 


130 


140 


150 


ACTGGTTCGC 


CtCCATCTCC 


ACCATCAAAC 


190 


20O 


210 


GCTCCTCCTC 


CCACCACACC 


TTCTTCTCCT 


250 


260 


270 


CCTCCTTCTT 


CTCGCTCTAC 


ACCTTCTGCT 


310 


320 


330 


CCGGGATCTC 


CACCTCCTCT 


TCCXCAiGCCG 


370 


380 


390 


CCACCCGCAC 


CTGTTACTCC 


TCCTACTCGA 


430 


440 


450 


TCCAATCCTT 


CACG CGAAJGG 


AGGATCTCCT 


4 90 


soo 


510 


CCTTCTTCCG 


ACGGTTTATC 


AACAGGAGTG 


550 


560 


570 


CTGCTTGTGA 


TAG TGAC T C T 


gatttgtctc 


610 


620 


630 




ACTATGXTCC 


TCCCCCACCT 


670 


680 


690 


GGTGGACAGC 


AGCAACAATG 


gcggcaacaa 


730 


740 


750 


ACGTCACTAC 


CACCACCACC 


TAAGGCfCCA 


790 


800 


810 


CCACCGCCTT 


TCATGAGCAG 


CAGCGGCGGC 


850 


d60 


870 


CCACCGTCTC 


CAGGGCTTGT 


6TTAGGCTTC 


910 


920 


930 


GC T AGAGCCA 


CCAATGGTTT 


CT CCGAGGCG 


970 


980 


990 


GTGCACAAAG 


GTGTGTTGCC 


TAGTGGGAAA 


103O 


1040 


1050 


AGTGGT CAGG 


G AG AGAGG G A 


GT TTC AGGCA 


1090 


1100 


1110 


AGGCATCTGG 


TGTCTCTTGT 


TGGTTATT GC 


1150 


1160 


1170 


GAGTTTGTTC 


CTAACAACAA 


TCTCGAGCTT 


1210 


1220 


1230 


GAATGGA6CA 


CCAGATTGAA 


GATTGCTCTT 


1270 


1280 


1290 


6AAGAT76CA 


ATCCTAAAAT 


CATTCACCGT 


1330 


1340 


1350 


TTCAAGTTTG 


AAGCTAAGGT 


TGCTGATTTT 


1390 


1400 


1410 


ACGCATGTAT 


CAACACGTGT 


GATGGGAACC 


1450 


14 60 


1470 


AGCGCSAAAGC 


TCACGGAGAA 


GTCTGACGTT 


1510 


1520 


1530 



40 


50 


60 


TCTCCTGCTG 


CrrCCTTTTA 


ACACTCTCTT 


100 


xio 


120 


GCAGACATGT 


CCTCGGCGCC 


GTCTCCGGGG 


160 


170 


180 


TCCACAACCA 


CCACTCCTCC 


TCCA6CTTCC 


220 


230 


240 


CCGCCGCCAT 


CCACTATTCC 


GACATCTCCT 


280 


290 


300 


CCTCCTCCAT 


CTCCACCAAC 


TCCATCTACG 


340 


350 


360 


TCTCCACCCG 


CTCCAACTAC 


GCCCGGATCT 


400 


410 


4^0 


AACCCTCCAC 


CTTCAGTCCC 


AGGACCACCG 


4 60 


470 


480 


CGACCTCCAT 


CTTCTCCCTC 


GCCGCCGTCT 


520 


530 


540 


GTGGTGGGAA 


TCGCCATCGG 


AGGAGTCGCT 


580 


590 


600 


CTCTGTAAGA 


AGAAACGACG 


GAGAGACGAA 


640 


650 


660 


CCTCCTGGTC 


CCAAAGCCGG 


AGGACCTTAC 


700 


710 


720 


AACGCAACAC 


CACCGTCAGA 


TCATGTCGXG 


760 


770 


780 


TCTCCACCAC 


GGCAACCTCC 


TCCACCTCCA 


820 


830 


840 


TCCGACTACT 


CGGACCGTCC 


AGTTCTTCer 


880 


890 


900 


TCCAAAAGCA 


CTTTCACATA 


CGAGSA6CTA 


940 


950 


960 


AACTTGTTAG 


GACAAGGCGG 


GTTCGGTTAC 


1000 


1010 


1020 


GAAGTTGCTG 


TGAAGCAGTT 


GAAAGTTGGG 


1060 


1070 


' 1080 


GAGGTTGAGA 


TCATGAGCAG 


AGTTCACCAC 


1120 


1130 


1140 


ATCGCCGGTG 


CCAAAAGATT 


GCTTGTCTAT 


1180 


1190 


12O0 


CACCTCCAXG 


GCGAGGGACG 


GCCTACAATG 


1240 


1250 


1260 


GGATCTGCTA 


AAGGACTTTC 


TTAXCTTCAT 


1300 


1310 


1320 


GATATCAAGG 


CTTCAAACAT 


ATTGATAGAT 


1360 


1370 


1380 


GGTCTTGCTA AGATTGCTTC 


TGATACAAAC 


1420 


1430 


1440 


TTTGGGTACT 


TGGCTCCGGA 


ATACGCTGCA 


1480 


1490 


1500 


TTCTCATTTG 


GCGTTGTGCT 


TTTGGAGCTC 


1540 


1550 


1560 



Figure 1 {d> (concinued on next 



page) 



